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Preface to the second edition 


In revising and enlarging the first edition I was greatly helped by comments 
and criticisms from several colleagues, but in particular Dr M. E. Barnett, Dr 
G. A. Brooker, and Dr L. J. Cox; needless to say, they may not recognize the 
results of some of their criticisms and they are in no way to be held responsible 
for what is in this second edition. The principal changes are, the addition of a 
short section on the speed of light, several additional sections on aspects of 
geometrical optics, some expansion of the chapter on laser light, and, lastly, a 
whole new chapter on optical light guides. This latter is, of course, pitched at a 
very elementary level and is not to be regarded as even the briefest 
introduction to, say, the use of optical fibres for communication; rather, it is 
intended to illustrate many of the ideas and principles developed in the 
previous seven chapters in a context which is of wide interest. 


Imperial College London W.T.W. 


May 1980 


Preface to the first edition 


Present-day physics courses are under increasing pressure, on the one hand to 
keep up with developments in fundamental physics and on the other to cover 
a broad range of topics appropriate to the interests of students who may never 
become professional physicists. Thus the time available for optics in the first 
or second year of an undergraduate course, as for other branches of physics, 
decreases, and this has influenced my choice of topics in this book; I have 
been very selective and, as can be seen from the contents list, I have chosen 
material which is either basic to the development of the optics of the visible 
spectrum or which has interesting links with other kinds of optics or other 
branches of physics. Some may be concerned about what is not to be found in 
this book, e.g. measurement of the speed of light, group velocity, standing 
waves, the envelope function for diffraction gratings, refractometry, Fresnel 
diffraction, and phase-change effects in interferometry. These omissions 
might have been dictated anyway by the agreed size of the Oxford Physics 
Series texts, but I do not plead this as an excuse. The book as it stands is 
intended as a reasonable selection of topics to be presented to under- 
graduates, perhaps in their first term at University and certainly having to 
cope with many other new things at the same time. 

I have tried to stress physical arguments, and in order to reduce the 
mathematical complexity I have introduced the concept of a complex 
amplitude in the first chapter. I have also used the formalism of Fourier- 
transform theory freely, since this illuminates and simplifies every branch of 
physics in which waves appear; this may seem rather extreme for an 
elementary text, but since simple experiments with lasers are most easily 
discussed in terms of Fourier transforms it seems almost certain that students 
will meet the transform in their laboratory work and will grasp the basic ideas 
even if they have not been Presented with a systematic formulation. However, 
Sections 5.5 and 6.6 contain some more difficult Fourier-transform material, 
which could be omitted in very early courses. The main definitions and 


theorems of Fourier-transform theory needed are given, without proofs, in 
the Appendix, 


Some of the 


i problems at the end of each chapter amplify the text by 
introducing sim 


ple extensions of the main discussion. 


Preface vii 


Ishould like to thank my colleagues Dr M. E. Barnett and Dr R. W. Smith 
for their help with this book, mostly given unknowingly; many of their ideas 
about the teaching of optics have gone into it. Also I am very grateful to 
Professor E. J. Burge, who read the first draft, gave very valuable criticism, 
and made many useful suggestions, and to Miss Lesley Harwood, who 
prepared the index; and I thank the staff of Oxford University Press for their 
help during publication. 

The quotations from James Joyce’s Ulysses are by kind permission of the 
Society of Authors, as the literary representative of the Estate of James Joyce, 
and of The Bodley Head, as publishers. 


Imperial College London W.T.W 
1975 
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1. Waves, rays, and particles 


But what I am anxious to arrive at is it is one thing to invent for instance those rays 
Réntgen did, or the telescope like Edison, though I believe it was before his time, 
Galileo was the man I mean. The same applies to the laws, for example, ofa far reaching 
natural phenomenon such as electricity .. . 


James Joyce: Ulysses 


1.1. The electromagnetic spectrum 


For many purposes optics can be regarded as the study of visible light, 
although in fact this light forms but a small part of a great range or spectrum 
of radiation. The most familiar part of this spectrum (apart from visible light) 
is probably the radio region (wireless waves). The complete spectrum of 
electromagnetic waves is described in Chapter 1 of Radiation and quantum 
physics (OPS 3) by D. J. E. Ingram. The waves are classified according to 
their wavelength 4 or their frequency v and these are related by 


Àv = speed of the wave. (1.1) 


Electromagnetic waves of all frequencies have the same speed in vacuum, 
approximately 3.10° m s~!: this universal constant is denoted by c. 

We shall begin by describing light and other parts of the electromagnetic 
spectrum as electromagnetic waves, but this is only one possible description; 
light (as all other regions of the spectrum) has many properties which are 
better discussed in terms of other representations (e.g. rays or particles), and 
we shall have to consider these also. 

An electromagnetic wave can be represented as in Fig. 1.1. The graph 
represents the strength of the electric field in the wave at a given instant and at 
different points along the direction z of travel. Figure 1.2 shows the same 
thing in a more picturesque way; the closeness of the lines indicates the 
relative strength of the electric field. Thus Figs. 1.1 and 1.2 can be regarded as 
snapshots of the wave in space, taken at a certain instant of time. We could 
also look at a single point in space and consider the variation in time of the 
electric field at that point; we should then have a graph like Fig. 1.3. 

A more complete picture would be obtained by making the graph of 
Fig. 1.1 move to the right along the z-axis at the speed c of the wave. The field 
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strength 
E 


Zo COS (272/4) 


Distance = 


FIG.1.1. | Theelectric field strength in an electromagnetic wave at a given instant as a 
function of the propagation distance z. 
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FIG. 1.2. The amplitude of a wave. The closeness of the lines represents the field 
strength and broken lines indicate negative amplitudes. 
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Timer 


FIG. 1.3. 


? The electric field strength in an electromagnetic wave as a function of 
time r. 


Strength at any point as time passes would then vary as in Fig. 1.3. This 


travelling wave then has electric field strength E at any distance z and any 
time t given by 


E = Ey cos 2n(vt — 2/2). (1.2) 


This is easily verified by keeping t or z constant and comparing with the 
expressions in Figs. 1.1 and 1.3 respectively. A wave travelling to the left, i.e. in 
the negative z direction, would have a positive sign in the argument of the 
cosine. 


To complete the picture of electromagnetic wave we ought to consider also 
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the accompanying magnetic field. But here it is sufficient to note that the 
magnetic field has a similar sinusoidal variation and that in the simplest 
situations, where the wave is not transferring energy to the medium through 
which it is travelling and where all parts of the wave are travelling in the same 
direction, the magnetic field varies in step or in phase with the electric field; 
both fields are at right-angles to the direction of travel of the wave and the 
electric and magnetic fields are perpendicular to each other. 

Different sections of the electromagnetic spectrum are produced and 
detected in different ways, and the waves have a variety of interactions with 
matter, (see Radiation and quantum physics (OPS 3)). Although we shall be 
mainly concerned with visible light, it is easiest to consider also the properties 
of radio waves. This is because many of the properties we shall be interested 
in—those which produce interference and diffraction effects—can be dem- 
onstrated and explained for radio waves with fewer complications than for 
visible light. 


"1.2. Power and energy 


Anessential property of all waves is that they transfer energy (from a source to 
adetector) without transferring the medium in which the waves occur. Indeed 
it is doubtful whether there can be said to be a ‘medium’ for electromagnetic 
waves. Thus the rate of energy flow or the power in a wave is of interest. It 
follows from the detailed study of electromagnetic waves that for a wave like 
that in Figs. 1.1-1.3 the power density (i.e. power per unit area across the wave 
transmitted in the direction of propagation) is proportional to the square of 
the electric field strength. We shall take this result as our starting point for a 
discussion of energy flow; it is treated in detail in texts on electromagnetism 
(e.g. Electromagnetism (OPS 1) by F. N. H. Robinson), where derivations 
and conditions of applicability can be found. Thus from (1.2) the power 
density is proportional to 

E2{1 + cos 4n(vt — 2/)}. (1.3) 


ne term causes a periodic fluctuation in energy flow across a 
0. The oscillating electric field induces an alternating 
and this constitutes detection of the 


Clearly the cosi 
certain plane, say z = 
voltage in a conductor (antenna), 
electromagnetic wave. i ‘ 

One of the major differences between electromagnetic waves at radio and 
at optical (and higher) frequencies is that we have no detectors which can 
respond fast enough to demonstrate optical frequencies directly. In fact the 
fastest detectors of light will respond only to frequencies of the order of 
10°-10!° Hz, some 5 orders of magnitude too low. Thus any detector of 
ation in the optical range responds only to the average 


electromagnetic radi ; 
averaged power is thus (from 


power over many cycles of the waves. This time- 
eqn (1.3)) proportional simply to Eo. 
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1.3. The complex exponential notation and the complex amplitude 


Another basic property of electromagnetic waves is that if two or more wave 
systems cross in a certain region of space, the electric and magnetic field 
strengths in this region are found simply by adding as vectors the fields from 
the individual wave systems. Thus we find the effect of overlapping waves by 
adding their field strengths or by linear combination. This simple result is not 
true for very large field strengths, and the topic of nonlinear optics has 
developed in the last decade now that such field strengths are available at 
optical frequencies. However, in this book, we shall assume linear combi- 
nation or superposition. 

Both interference and diffraction phenomena can be explained in terms of 
superposition of waves, and in this section we shall discuss the mathematical 
symbolism for this. 

Suppose we have two electromagnetic waves of the kind described in 
section 1.1, travelling at an angle 0 to each other, as in Fig. 1.4. Let the two 
waves have the same frequency (and therefore the same wavelength) and the 
same maximum field strength Eo. If we use axes as in the figure we can write 
the two waves as 

Eg cos 2n(vt — z/2), (14) 
Eo cos 2n{e + vt — (z cos 0 + y sin 0)/A}. 


In the expression for the second wave the constant £, known as a phase-shift 
term, allows for the Possibility that the two waves are not in step at the origin 
of the coordinate system, and the expression z cos 0 + y sin 0 ensures that the 
lines of constant electric field, or wavefronts, are at an angle @ to the y-axis. 
To fix our ideas we can regard each of the parallel lines in the figure as 


representing maximum field at a certain instant of time, but this is not 


essential. In order to find the interference field, as it is called, in the region 
where the waves cross we have to 


: add the two expressions (1.4). If we are 
dealing with optical frequencies we can only observe the time-averaged power 
density, which is, of course, what we ordinarily know as the light intensity, 
and so we have to square the sum of the two expressions in eqn (1.4) and find 
the time-average. There is no fundamental difficulty in doing this, but the 
manipulation of the trigonometrical expressions is very involved, particularly 
if we want to consider more than two waves and if they all have different field 
Strengths. This has led to the introduction of the complex exponential 
notation and the use of the complex amplitude to describe waves, as follows. 
First we replace an expression such as that in eqn (1.2) by 


E = Ey exp 2ni(vt — 2/A), 
where i is, of course, ,/ 
Superposing waves, 
concerned only with t 


— 1. We shall add complex expressions of this kind in 
but with the understanding that we are actually 
he real parts. Since real and imaginary always remain 


Waves, rays, and particles 5 


FIG. 1.4. | The superposition of two electromagnetic waves travelling in directions at 
an angle @ to each other. 


separate in summations, this is valid. The above expression represents a wave 
with plane wavefronts travelling in the z-direction. We can now represent a 
similar plane wave, travelling in an arbitrary direction specified by a unit 
length vector a, by the expression 


E = E, exp 2ni(vt — a*r/A), 
where r = (x, y, x) is the vector from the origin to an arbitrary point in space. 


We can check that this agrees with the second of eqn (1.4) by expanding the 
scalar product and remembering that the components of a unit vector are 


direction cosines. 

Next we put 2zv = w, the angular frequency, and we put 27a/A = k. k is 
called the wave-vector, and we shall also use the scalar |k| = 27/2, which we 
denote by k and call the wave-number. Thus our expression for a plane wave 


is 
E(t, r) = Eo exp i(e + wt — ker). (1.5) 
We have now indicated explicitly that the field strength E is a function of the 


position r and the time t, and we put in an arbitrary phase shift e. We get the 
effect of superposing n waves of this kind by adding the appropriate terms, 


E E, exp ile, + ot — kyr), 


n 
or, taking out the common factor exp iwt, since we have supposed all the 
waves to have the same frequency, 


exp iot £ E, exp i(é, — kẹ’ T). 


We can write the summation, which is independent of the time, as R + il, 
where R and J are two real functions of the position vector r. From section 1.2 
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the intensity in the wave-field is the time-average of the square of the real part 
of 


(R + il) exp iwt, 
i.e., the time-average of 
(R cos wt — I sin wt)?. 
It is easily verified that this time-average is simply 4(R? + 1°). The factor } is 
usually dropped. 

In this calculation the time-dependence of the waves appeared as a 
common factor exp ict to all terms, which vanished in the final time- 
averaging; and the final intensity R? + I? is simply the squared modulus of 
the summed complex expressions. 

Thus we have the rule that, to find the intensity due to several superposed 
plane waves of the same frequency, we add terms of the type 


E, exp i(¢, — k,*r) for the individual waves and take the squared modulus at 
the end to find the intensity. An expression of the type 


E exp i(e — kr), 


in which the time-dependent part is omitted, is called a complex amplitude. 
These quantities can also be used for superposing other than plane waves (i.e. 
convergent or divergent waves), and for calculations with all forms of wave 
motion, not only electromagnetic waves. It is only necessary that the waves all 


have the same frequency. As a trivial example, the complex amplitude of the 
wave in eqn (1.2) is 


Eo exp (—2niz/A), 


and the intensity is therefore immediately E3. If we now apply the procedure 


to the two waves of eqn (1.4) we easily find, for the intensity in the plane z = 0, 
the expression 


2E3(1 + cos {(22/A)y sin 0}). 


This is a typical two-beam interference expression; we shall examine it more 
closely in Chapters 3 and 6. 

As we noted earlier, the intensity, which has dimensions of power per unit 
area, is strictly proportional to Ej, i.e. in our present terms it is proportional 
to the squared modulus of the complex amplitude. The proportionality 
constant is important both for its dimensionality and for its numerical 
magnitude in connection with radio wave and microwave theory, but it is not 
important in the optical problems that we shall encounter. Thus for many 
Purposes we can ignore the electromagnetic nature of light and discuss its 
Properties in terms of a complex amplitude of some undefined quality or 
medium. Often we need not even specify whether the wave motion is 
transverse (electromagnetic waves or waves on a string) or longitudinal 
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(sound waves in air). This apparently abstract approach has advantages: 
parallels with other kinds of wave can be drawn, and we shall find it easier to 
come to terms with the fact that even the electromagnetic theory is not 
adequate to explain all optical phenomena. 

It is found that all kinds of waves have to be characterized by two different 
quantities. These are of widely differing physical natures, depending on the 
kind of wave, but in all cases there is an amplitude, which varies in time and 
space and gives interference effects, and an intensity, which represents the rate 
of energy transport. With suitable interpretations the complex amplitude and 
its squared modulus, the intensity, can be used in all cases. All interference 
experiments and many diffraction experiments can be described in these 
terms. 


1.4. Sources and detectors 


The production and detection of different parts of the electromagnetic 
spectrum are described in Radiation and quantum physics (OPS 3). Many of 
the effects and techniques which we usually call ‘optical’ apply mainly to the 
infrared, visible, and ultraviolet regions. In these regions there are three main 
kinds of source: 


(1) thermal sources which produce a continuous spectrum, e.g. solid hot 
bodies, such as filament lamps, and hot gases under high pressure, as in 
an electric discharge through xenon (e.g. a flash tube); 

(2) thermal sources giving line spectra, e.g. mercury vapour or neon 
discharge tubes under low pressure; 

(3) lasers. 


The first two are sometimes said to emit thermal or chaotic light; the latter 
term reminds us that the phase relationships between light emitted by 
different atoms or molecules are quite random, whereas in lasers the atoms 
emit in phase with each other. A continuous spectrum emitted by a source of 
the first kind may sometimes approximate to black body radiation (Radiation 
and quantum physics, OPS 3). 

We can describe the production and detection of radio waves quite well in 
terms of the classical theory of electromagnetism, i.e. without invoking the 
ing quantum theory. However, in the optical 
region of the spectrum we have to introduce quantum concepts in order to 
explain light production and detection, although effects concerned with 
propagation alone (e.g. interference and diffraction) can be described in terms 
of a simple wave theory, usually involving only the use of the complex 
amplitude. 

The quantum theory of light emission and absorption is explained in 
Radiation and quantum physics (OPS 3). Here we need only note that 
electromagnetic radiation is emitted or absorbed in finite quanta of energy 


existence of electrons or us 
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called photons. The amount of energy in a photon depends on the frequency of 
the radiation and is given by 


E=hv=he/i (1.6) 


where h, the Planck constant, is 6.626 x 1073+ Js. The energy per photon is 
sometimes given in electronvolts (eV); 1 eV is 1.602 x 10~!° J. The emission 
or absorption of a photon corresponds to a change in the energy of an atom, 
molecule, or other system. In the infrared these transitions are between 
rotational or vibrational states of molecules; in the visible and near 
ultraviolet they correspond to changes in the energy levels of electrons in the 
outer orbits of an atom; and in the far ultraviolet and X-ray regions the inner 
electrons are involved. These are progressively greater changes in energy of a 
molecule or atom, and so they produce more energetic photons. This affects 
the mode of detection. A far ultraviolet photon has enough energy to ionize a 
gas atom or molecule, and it can be detected by an ionization chamber; 
alternatively it can cause photoelectric emission of electrons from almost any 
conductor, and so it can be detected by photodiodes. Visible-light photons 
can produce photoelectrons from particular surfaces (photocathodes), so 
that they can also be detected photoelectrically; they can produce a latent 
image ina Photographic emulsion (a complex process which is not yet fully 
understood), and they can produce photochemical reactions, one of which is 
the starting point of the process of vision. 
We call all the above Processes quantum detection processes, because they 
involve a change in state of an individual atom or molecule in a detector by a 
single quantum. As we move up the wavelength scale into the infrared, fewer 
quantum detectors are available and the radiation is detected by its general 
heating effect when it is absorbed, e.g. by a thermocouple or by a resistance 
thermometer (thermal detection processes). The main reason for this dif- 
ference is that, in the infrared, the amount of energy in a photon is so small as 
to be comparable to the average random energy of thermal motion of the 
atoms or molecules in a detector. This thermal energy is of order kT, where k 
is the Boltzmann constant (1.381 x 10-73 J K~!) and T is the absolute 
PRAN Thus at ordinary temperatures the thermal energy is about 
y RAEES should not expect to be able to make detectors depending on 
S u m effect for photons of energy less than or comparable to this value. 
he area en dni nna a ey oligo 
rocedi ert nw ich the limitation is circumvented by specia 
» Out, broadly speaking, the small photon energy in this region of 


the spectrum is the reason fi Eo mest 
or the distinc a rmal 
detectors. tion between quantum and therma 


Ponda merespons to eeaomsentic radiasion 
as 107° s: the TARE tipliers respond to changes taking place as rapidly 
effect known tousa oe See changes slower than about 0-05 s, an 

S Persistence of vision; and photographic detectors can 
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add or integrate light flux for perhaps several hours, until a certain saturation 
of exposure has been reached. We can also classify detectors according to 
whether they record images or total flux. Photomultipliers, ionization 
chambers for X-rays, and thermocouples for the infrared are total flux 
detectors; but the eye, the photographic plate, and television camera tubes 
record detailed images, i.e. they are flux density detectors. 

It is often important to know whether a detector is linear in response. The 
output of a detector can take many different forms, such as a voltage, a 
current, blackening of a photographic emulsion, or a visual impression in the 
brain. If we refer to any of these as the ‘signal’ the detector is said to be linear in 
response if the signal is directly proportional to the flux falling on the 
detector. Thus the signal corresponding to the sum of two fluxes is the sum of 
the individual signals. Photocells and photomultipliers are linear over many 
orders of magnitude if they are used with suitable circuits, but photographic 
detectors are usually not. The eye is very non-linear; the sensation of 
brightness is roughly proportional to the logarithm of the light intensity. 

A final interesting property of detectors is their spectral range of sensitivity, 
or working range. This is roughly indicated in electromagnetic spectrum 
charts (see Radiation and quantum physics, OPS 3), but more information 
can be provided by a graph. Such a graph may take several forms: (1) we can 
plot the output signal per unit wavelength interval which the detector would 
give if used with a fictitious source giving the same energy flux per unit 
wavelength interval all over the spectrum; (2) we could make a similar plot 
but per unit frequency interval for detector and source; (3) for a quantum 
detector we can plot the reciprocal of the average number of photons required 
to produce one photoelectron as a function of wavelength or frequency, this 
being called the quantum efficiency. Figure 1.5 shows spectral-sensitivity 
curves for the eye and for some photocells. The difference in response over 
even the narrow range of the visible spectrum complicates the comparison of 
responses of different kinds of detectors. For example, it can be seen that ifwe 
had beams with equal powers in watts (W), but of violet and green light, an 
antimony-caesium photocathode would suggest that the violet was the more 
intense, but an eye would indicate the reverse. This has led to the use ofa 
special system of visual photometric units applicable to the eye in which the 
lumen is the unit of flux (see Welford 1962); a lumen is the equivalent of about 
1.467 x 1073 W of green light or about 1.2 W of violet light of wavelength 
0.410 um. In this book we shall not use this visual system of units, since it 
to one detector; light flux will be measured either in watts 


applies specifically T 
e wavelength or frequency specified. 


or in photons per second, with thi 


1.5. Monochromatic and polychromatic fields 


We must now consider an essential difference between radio waves and light 
waves. In sections 1.1 and 1.2 we represented a radio wave as a sinusoidal 
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FIG. 1.5. Spectral sensitivity of the normal eye (curve 1), of an antimony-caesium 
photocathode (curve 2) and of a sodium-potassium-antimony-caesium photocathode 
(curve 3). The ordinate scale for the eye, on the left, is in arbitrary units, scaled to unity 
at the maximum sensitivity. The scale for the photocathodes, on the right, is in 
milliamperes of Photocurrent per watt of incident light power. 


variation of electric field (see e.g. eqn (1.2)), so that if a detector with 
sufficiently rapid response were stationed at a fixed point in the field the 
output signal would be a strictly sinusoidal or simple harmonic function of 
time. This is very accurately true for unmodulated radio waves, e.g. the simple 
carrier wave for radio or television. However, it is not generally the case for 
visible light. If we could examine the light vibrations carefully over a sufficient 
number of cycles of the vibrations (and there are indirect ways of doing this), 
we should find that, although the vibration seems to be simple harmonic for 
Short lengths of time, when examined over longer periods the amplitude 
Varies irregularly and the maxima and minima do not recur at exactly equal 


time-intervals, Figure 1.6 suggests this effect of random variation of phase 
and amplitude in a beam of light. 


The cause of the tandomne: 
generated by an alternating c 
and it is as if all the elect 


Ss in a light beam is easily seen. A radio wave is 
urrent (i.e. a stream of electrons) in a conductor; 
Tons are in step all the time, to a very close 
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FIG. 1.6. An almost monochromatic wavetrain. 


approximation. However, a beam of light is the summation of a large number 
of elementary waves (i.e. photons) emitted by atoms or molecules, and in 
general there is no fixed relationship between the times at which the different 
photons are emitted. Thus the instantaneous amplitude in the beam of light is 
obtained as the resultant of many independent waves of random phase but 
the same frequency. 

Under certain conditions we can still regard the light beam as simple 
harmonic, and then it becomes fairly easy to discuss interference and 
diffraction. It is for this reason that we said earlier that it is simplest to speak 
in terms of radio waves first. The two kinds of wave motion we have described 
may be called monochromatic and polychromatic. 

The actual lengths of time for which light beams can be regarded as simple 
harmonic vary greatly. At one extreme we have ‘white light’ from, for 
example, a tungsten filament lamp. Such light contains a continuous range of 
wavelengths—as was shown by Isaac Newton, using a prism—and since the 
wavelengths have random phases we would not expect the simple harmonic 
property to persist for more than a few periods, ie. about 107'*s. In a 
‘monochromatic’ beam, e.g. one of the spectrum lines from a mercury 
discharge lamp, for several reasons the photons have a (small) range of 
frequencies, and since they are emitted with random phases there is again a 
finite time involved. Depending on the temperature and pressure of the 
discharge the time for which the wave persists as substantially harmonic may 
be some thousands or tens of thousands of periods, i.e. 10~''-107'° s. By 
means of the speed of light (3 x 10° m s~') we can express this in terms of the 
length of the wave train which is substantially simple harmonic in form, i.e. a 
few millimetres to a few tens of millimetres. These times and distances can 
only be stated approximately, since the wave-trains do not change abruptly 
but gradually, as in Fig. 1.6. They are determined by experiments with an 
interferometer (see section 6.2). 

Laser light contains even longer stretches of simple harmonic waves. This is 
because the mode of action of the laser constrains the light-emitting atoms to 
emit photons which are precisely in phase with each other, instead of having 
random phase relationships, as in an ordinary light source. There are still 
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some random fluctuations, but in carefully stabilized lasers the light may be 
truly simple harmonic for times as long as 10~® s, so that the wave-trains are 
about 1000 m long. 

The time for which the wave-train remains simple harmonic is called the 
coherence time and the corresponding distance, i.e. this time multiplied by the 
speed, is the coherence length. These quantities determine the possibility of 
getting certain interference effects. If we have two radio beams from two 
aerials powered from the same radio-frequency transmitter, there is always 
the same phase relationship between the electric fields at a certain point. Thus 
in Fig. 1.7if A and B are the two transmitters, and if P, is4 wavelengths from 
A and 5 wavelengths from B, there will be a powerful signal at P, ina receiver, 


nin Interference between coherent waves. The arcs indicate maxima of the 
pinuak at a given instant. At P, the disturbances reinforce at all times, at P, they 


since the two waves add in phase. However, at Ey 
from A and 5 wavelengths from B, there will be ne 
map regions of maximum and zero signal, and i 
several navigational systems. In optical terms the two sources generate an in- 
terference pattern. When a ship or aircraft travels through the interference 


Ere its changes in position can be found from the maxima and minima 
traversed; ambiguities can be resolved by, for example, setting up two 
interference patterns of two different frequenci 


in the optical rather thanGn thesans es. However, if the sources are 
between the fields at a n oe radio-frequency range, the phase relationship 
thecoherencen x certain point is only constant for times comparable to 

ee time. Since the coherence time for ordinary sources is shorter 
than the response time for the fastest detectors 


f this means that it is impossible 
to Si Š £ 
observe interference between two different light sources. Alternatively, we 


which is 4.5 wavelengths 
arly zero signal. It is easy to 
n fact this is the principle of 
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can say that the interference field varies randomly more rapidly than the 
response time of the detector and so the detector records a time-averaged 
interference field. This is why optical interference experiments are always 
done with light from the same source, split and suitably recombined. 


1.6. Waves, particles, and rays 


Theories of the nature of light have alternated between those involving waves 
and those involving particles. For waves the essential feature is the wavefront 
or surface of constant phase, and for particles it is the ray or particle 
trajectory. Very persuasive experimental evidence is available for both kinds 
of theory: interference and diffraction effects for wave theories: quantum 
phenomena of emission and absorption and the effects of geometrical optics 
for particle theories. 

The present-day view is that we do not understand everything about light 
(or indeed about any other physical effect), and that in order to obtain 
theories from which we can make true predictions of experimental results we 
sometimes have to use wave ideas and sometimes particle ideas. This view has 
arisen during the present century, from the development of quantum 
‘mechanics. According to this theory any particle has wave-like aspects which 
must be appealed to in order to predict results of some kinds of experiment, 
and equally, waves have particle-like aspects which must be used in certain 
cases. In any given optical experiment either the wave or the particle aspect 
must be stressed. 

Figure 1.8 shows the relationships between some of the principal different 
descriptions or models of light; beside each are listed some of the effects for 
which the description is used. We can regard these as a series of approxi- 
mations, starting from the most detailed (and mathematically most elab- 
orate) in terms of photons. Then, if loosely speaking, we allow the Planck 
constant h to tend to zero we arrive at electromagnetic waves; if we neglect all 
but one component of the electric vector we obtain scalar waves; if we let the 
wavelength tend to zero we get rays and geometrical optics; and finally, if we 
assume all angles are very small, we obtain Gaussian or paraxial optics. We 
must not, however, regard, say, Gaussian optics as being in any sense ‘wrong’ 
because so many approximations were involved. It is simply the right 
formulation for solving certain problems, e.g. it is easy to get the well-known 


thin-lens formula 


E a 
from Gaussian optics but it would be rather tedious to derive it by rigorous 


quantum mechanics. In this way optics shows clearly that we have no 


universal physical theory which will explain and predict everything, and that 
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FIG. 1.8. Descriptions of light. 


meanwhile we must be careful to use 
problem. 
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1.7 The speed of light 


The history of the determination of the speed of light starts with an 
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unsuccessful attempt by Galileo; the first measurement came from astro- 
nomical observations and thereafter many famous physicists made 
successively more precise measurements; finally, at the time of writing, an 
international commission is giving serious consideration to a proposal to 
define the speed of light in vacuum (as 299 792 458 m s™!) in terms of 
measured lengths and frequencies! 

Galileo reported in 1638 that he had stationed two men with shuttered 
lanterns some distance apart and had attempted to time the interval between 
directing one man to uncover his lantern and receiving a corresponding 
signal back from the second man. This we now know could not have 
succeeded with the simple equipment then available. In 1675 Römer, a 
Danish astronomer, noticed that the eclipses of the satellites of Jupiter 
occurred at varying intervals as the planet traversed its orbit round the sun 
and he deduced that this was due to the finite speed of light; Römer produced 
the first estimate, some 30 per cent lower than the correct value. Then Fizeau, 
Foucault, Cornu, Michelson, and others invented successively more accurate 
methods of timing light signals over terrestrial distances until by about 1940 
the value was known to about one part in 30000. From about 1950 
measurements on microwaves and radio waves gave a precision of about one 
part in 10° with good agreement with the optical measurements. More 
recently developments in lasers and nonlinear optical techniques have made 
it possible to compare optical and radio frequencies and thus to achieve an 
accuracy of 2 in 10!" in optical frequency determination. Also the wavelength 
of light can be determined to about one part in 10°. Thus both wavelength 
and frequency can be determined much more precisely than speed and so it 
seems reasonable to define c as given by Av. 


Problems 


1.1. Calculate the wavelengths of electromagnetic waves of frequencies 10°, 10°, 10'?, 
and 10!§ Hz. 

1.2. Write down an expression for the complex amplitude of a spherical wave diverging 
from a point source. 

1.3. Calculate the power density in the spherical wave of Problem 1.2, and show that 
this leads to the inverse-square law. 

1.4. Draw graphs of the energy in a photon as a function of (a) wavelength and (b) 
frequency, using logarithmic scales to accommodate the spectrum from X-rays to 
radio waves. 3 

1.5. What are the wavelength and frequency of radiation of which the photon energy is 
of the order of magnitude of the room-temperature thermal energy of atoms? 

1.6. Plot a graph of coherence length against coherence time, and mark on it points 
corresponding to typical sources discussed in Chapter 1. 

1.7. Calculate the quantum efficiencies at wavelength 0-45 um of the two photo- 


cathodes of Fig. 1.5. 


2. Geometrical optics 


He faced about and, standing between the awnings, held out his right arm at arm’s 
length towards the sun. Wanted to try that often. Yes; completely. The tip of his little 
finger blotted out the sun’s disc. Must be the focus where the Tays cross. 


James Joyce: Ulysses 


2.1. The use of geometrical optics 


From the point of view of a pure physicist, geometrical optics is a crude 
approximation for predicting in broad outlines, and with many reservations, 
how electromagnetic waves behave. It can also predict, with similar 
reservations, the trajectories of electrons, neutrons, etc. 

The applied physicist sees geometrical optics very differently. It is his most 
important tool for designing many kinds of optical system. Chiefly these are 
image-forming optical systems for light and for electrons (e.g. optical and 
electron microscopes and astronomical telescopes), but geometrical optics is 
essential for some aspects of the design and use of almost any optical system, 
from a shaving mirror to a single-lens reflex camera. In addition, it is difficult 
to describe interference and diffraction without using some of the ideas of 
geometrical optics, such as mirrors and collimators. 

The basic concept of geometrical optics is a simple, everyday notion—light 
travels in a straight line unless it is reflected, according to a law which seems 
intuitively obvious, or refracted, according to a rather less obvious law. These 
laws can be verified approximately using very simple apparatus, but the 


accuracy with which elaborate optical systems work gives us a very precise 
verification. 


2.2. Rays, wavefronts, reflection, and refraction 


A ray (of light) is a familiar concept, and we have to carry out rather careful 
experiments to show that straight-line propagation of light is not exactly true. 
In geometrical optics we work in terms of rays of light and an associated 
abstraction, the point source of light, giving (as in Fig. 2.1) a bundle or pencil 
of rays emitted in all directions. We admit that light travels at a known finite 
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FIG. 2.1. A geometrical wavefront as a surface of constant phase. 


velocity, so that in Fig. 2.1 we can mark on all the rays the points the light 
reaches in a certain time t after leaving the point source P. If this is supposed 
to occur in a vacuum or—what is usually assumed to be the same thing in 
geometrical optics—in air, these points lie on a sphere of radius ct, as shown. 
If we reverted to a wave picture of light, this sphere would be the surface 
reached by a wave starting out from P at zero time, i.e. it would be the 
wavefront. In the present context we are not strictly concerned with waves, 
but these surfaces can be very useful in geometrical optics, particularly if we 
consider them after the light has passed through lenses or other optical 
components. Strictly speaking they are called geometrical wavefronts, but we 
shall usually call them simply wavefronts. Thus a wavefront is a surface 
reached by the light from a point source in a certain time. 

The geometrical wavefronts are almost coincident with the true phase 
fronts, or surfaces of constant phase according to physical optics, in most 
regions, but near foci and near the edges of shadows there can be considerable 
differences, as we shall see in Section 7.1. 

Clearly, the rays are normals to the wavefronts in Fig. 2.1. It happens that 
this is nearly always true in optical systemst although. we shall not give the 
proof of this here, so that we can imagine a system of mutually orthogonal 
rays and wavefronts propagating through an optical system. 

Rays of light go through lenses and mirrors according to the laws of 
refraction and reflection. The law of reflection, that the incident and reflected 
rays make equal angles with the normal to the reflecting surface and are 
coplanar with it, seems intuitively obvious on almost any theory of light. The 
origin of this law has not been traced; it was known to Euclid, in about 
300 ac. The law of refraction is also concerned with the relationships of the 
incident and refracted rays with the normal to the refracting surface, e.g. the 
surface of a sheet of glass. The two rays and the normal are again coplanar, 
which seems reasonable from symmetry, and the sines of the angles of 


+ With the exception of certain crystals, but these are not usually used in the kind of 
Optics considered in this chapter. 
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incidence and refraction have a constant ratio, as in Fig. 2.2.¢ This ratio 
depends on the materials on either side of the refracting boundary, and also 
on the wavelength. The law of refraction was discovered early in the 
seventeenth century by a Dutchman, W. Snell, and it is therefore called Snell’s 
law. 

At first sight the law of refraction seems obscure and ad hoc. We may 
wonder, why sines rather than tangents, or any other function, of the angles? 
However, in terms of wave theory the form of the law is almost inevitable. 
Consider a beam of parallel rays striking a plane refracting surface as in 
Fig. 2.3, and let PP, be a wavefront of the incident beam which meets the 
surface at P at time zero, say. After a certain time t, P, has reached the surface 
at Q, and P has travelled on to Q. Thus if the velocities in the two media are v 
and v’, we have 


P,Q, = vt, PQ = v't, 


or 
PQ, sin J = vt, PQ, sin I’ = v't, 
Normal 
AIR 
GLASS 
A 
M 
4A 
FIG. 2.2. Snell’s law of refraction, sin J/sin I’ = const. 
FIG. 2.3. 


Snell’s law obtained from wave theory. 


+The law as stated is not true for certain crystals which are anisotropic, i.e. their optical 
and other properties vary with direction inside the crystal (see Chapter 4). 
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from which we obtain by eliminating PQ,, 


sinl v 


sin? v` 
This is Snell’s law. It is usual to put v/c = 1/n, where, as in Chapter 1, c is the 
speed of light in vacuum; n is then called the refractive index. Snell’s law now 


takes the form 
nsin I =n’ sin I’. (2.1) 


The refractive index of a material is a function of wavelength, and for most 
transparent materials in the visible region it lies between ~ 1.3 and ~2.3. The 
above argument leads also to the law of reflection. At this point we can put 
them together and for convenience, call them both Snell's law.7 

Snell’s law can be obtained in an entirely different way, from Fermat's 
principle. This principle states that if light travels from A to B through any 
optical system it will follow a path such that the time of travel is stationary 
with respect to neighbouring, but not physically possible, paths. ‘Stationary’ 
means that the time of travel may be a maximum or a minimum, or may 
simply have zero rate of change, as at a point of inflection. The time function 
at a stationary point could also be behaving like the altitude at the top of a 
mountain pass, a minimum in some directions and a maximum in others (this 
is called a saddle point). Pierre de Fermat first stated the principle in 1657 in a 
form implying that the time of travel is a minimum for the physically possible 
path (‘Nature always acts by the shortest course’), but stationarity is strictly 
more correct, Since the velocity of light in a medium is c/n, where n is the 
refractive index, this principle can be stated in the form 

B 
Í n ds is stationary, (2.2) 
A 


where ds is a differential element of length along any one of the paths from A 
to B. This is illustrated in Fig. 2.4. The integral of n ds, which as we saw is 
proportional to the time of travel of the light, is called the optical path length. 
Thus the optical path length from a point source to all points on a given 
wavefront is constant. 

Fermat’s principle is analogous to the principle of least action proposed by 
Maupertuis (1744) as a foundation for mechanics. For a particle in a 


+ This is not generally accepted usage, but it is very easy to put eqn (2.1) into the form 
of the law of reflection by putting n’ = —n, as a formal device, for a light ray returning 
into the first medium after reflection. This then gives reflection as a special case of 


refraction. 
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FIG. 2.4. Fermat's principle. The full line represents a physically possible ray path 
from A to B and the broken line another path. (a) For lenses. (b) For a medium of 
continuously varying refractive index. 


conservative field of forcet the action is the integral of momentum along the 
trajectory, i.e. fp ds, and the principle states that for this case the action for a 
physically possible path is a minimum. We can relate Fermat's principle and 
the principle of least action by formally making momentum proportional to 
refractive index. This can be justified for the photon model of light and also, 
with a suitable definition of refractive index, for electrons. 

It is possible to derive Snell’s law from Fermat’s principle by finding the 
shortest optical path between points on either side of a refracting boundary 
(see Problem 2.2). Snell's law can be verified by experiments with prisms. The 
law is the basis of optical design of all lens systems, and the fact that they work 
as designed provides extensive experimental verification of the law and thus 
of Fermat's principle. 

, If we apply Snell’s law (eqn (2.1)) to a ray passing from glass of refractive 
index n to air we find that at an angle of incidence in the glass /,, given by 
sin I, = 1/n, the refracted ray ‘emerges’ at an angle of 90° to the normal; for 
larger angles of incidence eqn (2.1) gives sin l’ > 1, and it is found 
experimentally that the light is completely reflected at the boundary—so- 
called total internal reflection. This follows rather simply from the elec- 
tromagnetic theory. The angle I, is called the critical angle. Total internal 
reflection is used in reflecting prisms, of which the simplest and best known is 
the right-angle prism, as in Fig. 2.5. Light is reflected internally with 100 per 
cent efficiency whereas at silvered or aluminized reflecting surfaces there is 
always some loss by absorption of the incident light. The determination of the 


critical angle is the principle of certain refractometers, i.e. instruments for 
measuring refractive index. 


Leni was field the work done on a particle in transporting it from A to Bis 
ERNS a the path from A to B. The force on the particle is the gradient of a scalar 
: ‘valued potential. Gravitational and electrostatic fields are examples of con- 
servative fields, but a magnetic field is not conservative. 
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FIG. 2.5. A right-angle prism; there is total internal reflection at the hypotenuse 
face. 


2.3. Optical images with a thin lens 


The formation of images in the optical region by lenses and mirrors is 
familiar, and it can be demonstrated in many other regions of the 
electromagnetic spectrum. A simple explanation is as follows. Figure 2.6 
shows a plano-convex lens. Light from a point source O on the axis, e.g. a 
pinhole in metal foil with a lamp behind it, produces a diverging pencil oflight 
with spherical wavefronts convex to the lens. The lens is thicker at the axis. 


o 


FIG. 2.6. Formation of an image by a lens. 


Thus the optical path length through it is greatest there, since the refractive 
index of the glass is greater than that of air, and the wavefronts transmitted by 
the lens will be retarded at the centre relative to the edge. Suppose for 
simplicity that the emergent wavefronts are also spherical in shape, as indeed 
they must be by symmetry fora small enough diameter lens. Then they may be 
convex with a longer radius of curvature, because of the greater delay at the 
centre, or, if the shape of the lens and the position of O are suitable, they may 
become concave, as indicated. Drawing the rays as normals to one of these 
wavefronts, we see that they intersect at some point O' on the axis—this is the 


image of O. 


yorrur 1538- 
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We can put this in an alternative but entirely equivalent way as follows. If O 
and O' are to be object and image then all rays which enter the lens from O 
must pass through O’. But Fermat’s principle (Section 2.2) states that all 
those ray paths must be such that the optical path length from O to O’ along 
them is stationary, so that the optical path length from O to O' must be the 
same along all the rays. Thus the greater glass path through the centre of the 


lens compared to the edge is compensated by the shorter length of the rays in 
air at the centre. 


If the object point O is not on the axis a similar argument will show that an 
image point is again formed, with suitable approximations. Thus we get an 
image of an extended object. 

It remains to obtain simple formulae to describe image formation, and this 
is most easily done by considering that typically optical abstraction, the thin 
lens. This is a lens (as in Fig. 2.7) of refractive index n and with curvatures 
(i.e. reciprocal of radius of curvature) c, and c, of the refracting surfaces. We 


oò 


Wavefront 


FIG. 2.7. Formation of a point image. 


neglect the thickness of the lens. This seems paradoxical for the lens in the 
figure, which is biconvex, but the approximation gives useful results. We 
assume that an object point O at a distance / from the lens produces an image 
at O', a distance I’ from the lens, and we want to find I’ in terms of l. 

We have to choose signs for these lengths, and to facilitate this we set up a 
coordinate system with origin at the lens (it does not matter exactly where in 
the lens since its thickness is negligible) and z-axis along the optical axis. Then 
in our figure O' has a positive z-coordinate so that l'is a positive number, and 
O has a negative coordinate, so that lis negative. We can use the same system 
to give signs to the curvatures, since the equation of a sphere passing through 
the origin and with its centre to the right of the origin is of the form 


z=}łe(x? +y?) +., 


with positive c. Thus in our diagram it happens that c, is positive and c2 
negative. We have now settled the problem of signs according to the ordinary 


Geometrical optics 23 


conventions of coordinate geometry, and we can use the symbols as in 
coordinate geometry, i.e. without further concern for signs. 

Ata distance y from the axis the lens will be (c, — c)y* thinner than at the 
centre, by the above formula, i.e. this amount of glass path is replaced by air, 
so that the optical path through the lens between two planes tangent to the 
surfaces will be shorter here than at the centre by 4(n — 1) (c, — c2)y*. The 
depth of curvature of the incoming wavefront is, as in the figure, y*/2/, and 
that of the emergent wavefront is y?/2I', so that to ensure that optical path 
lengths (i.e. times of travel between corresponding points of two wavefronts) 
are equal we must have 

me 


1 TEN ag 
TEA +5 (n — 1) (cy — ¢2)y* = F y? 


or 

eae | 
p T Mie): (2.3) 
This conjugate distance equation relates the positions of object and image 
points, which are said to be conjugates. As we have derived the result, the 
effect of the lens is to add to the incoming wavefront an increment of 
curvature of amount (n — 1) (c, — c2). If the incoming rays are parallel, i.e. 


the incoming wavefronts are plane or the object point is at infinity, the image 
distance I is given by 

1 
h- leae)’ 


as in Fig. 2.8. This distance is called the focal length f, its reciprocal K is the 


power of the lens and the image point is called the second principal focus (any 
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FIG. 2.8. An object at infinity is imaged at a principal focus. 


point where rays meet is a focus). Similarly, if the object point is at a distance | 


given by 
-1 
|=———————= 
(n — 1) (cy — €2) 
allel, as in Fig. 2.9. Thus the point at infinity and a 
d image conjugates. These are two important 
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FIG.2.9. A collimator. The object at a principal focus produces an image at infinity, 
ie. a collimated beam. 


cases for physical optics applications. The second is used, as a collimator, to 
produce a beam of plane waves from a point source; and the first is used to 
show a diffraction or an interference effect which is nominally formed at an 
infinite distance—in the far field—at a convenient place for observation (see 
Section 3.4). 

If the lens forms an image of a small object, say a line segment of height y, as 


in Fig. 2.10, the image will have a height 7’, and the ratio of these is the 
magnification m, 


m = n'hy. (2.4) 


Let « and «' be the semi-angles of the cones of rays forming the image, and let l 
and I’ be the conjugate distances. The ray from the end of the object through 


FIG. 2.10. The image of an extended object. In this figure x and y are positive 
quantities and x’ and yn’ are negative. 


the centre of the lens passes through the lens undeviated, since the lens has its 
surfaces parallel at the centre, and so we have 


nfl=n/l 
or 


m=l'/L (2.5) 
Also, if y is the height at which the other ray shown meets the lens, 
a= —y/l,a = —y/l', 
So that, from eqn (2.5), 
m = aja’. (2.6) 


(2.5) and (2.6) are correct for all signs of the variables, as explained in the 
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caption to Fig. 2.10. In all the above we have been tacitly using the Gaussian 
or paraxial approximation of small angles mentioned in Section 1.6. 

If the object is at the first principal focus, i.e. the focus on the object side, as 
in a collimator, the rays from each point of the object forma parallel pencil, «’ 
is zero, and eqns (2.4) and (2.6) are not applicable. In this case we say that the 
image is formed at infinity and we use its angular subtense as a measure of its 
size. Thus if the object height is 7 (as above) and if the focal length is f, the ray 
from the end of the object through the centre of the lens emerges at an angle 
n/f to the axis, since it is undeviated, and all rays from this point therefore 
make this angle with the axis in the image space. Thus we have the rule that a 
point on the focal plane a distance y from the axis produces plane parallel 
wavefronts travelling at an angle ņ/f to the axis (collimator). Conversely plane 
parallel wavefronts travelling at an angle £ to the axis form a point image at 
the focal plane a distance ff from the axis (objective). 


2.4. Multi-element lenses 


The above ideas can be generalized to lens systems which contain a 
combination of several more-or-less thin lenses, (e.g. the type of system used 
in a camera). From eqns (2.5) and (2.6) the magnification depends on the 
positions of object and image, so it is reasonable to suppose that for a multi- 
element or thick system we can find a pair of conjugate planes with 
magnification unity. In most practical cases these will be virtual conjugates, 
i.e. inside the system as in Fig. 2.11, where they are denoted by P and P’, but 


FIG. 2.11. The principal points of a thick lens, P and P’. 


the principal planes, and the axial 


this is not important. These are known as c 
for a thin lens the principal points 


points are the principal points. Obviously, 


coincide at the lens. ae 
Another ray from an axial object point Oto its image O’ passes through the 


two principal planes at the same distance from the axis, since they are planes 
of unit magnification. Now we can compare Figs. 2.10 and 2.11 for a thin and 
a thick lens. The figures are similarly labelled to indicate their essential 
similarity, but in the thick lens there is a kind of limbo or missing space 
between the two principal planes. However, we can measure l from the first 
principal plane to the object and I from the second principal plane to the 
image, and all will correspond in the two cases. For example, the focal length 
is the distance from the principal plane to the point where rays from infinity 
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focus (as in Fig. 2.12), and the magnification is again given by %/x' = l'/l. 
Corresponding to eqn (2.3), there will be a conjugate distance equation, 
A HS 
-diaaa Ta (2.7) 
{gta E 
but now f, the focal length, depends on the detailed construction of the lens 
system. 


FIG. 2.12. The principal focus F’ and the principal planes for a thick lens. 


2.5. Paraxial raytracing 


The positions of the principal planes of a multi-element optical system can be 
found as in Fig. 2.12; since the planes through P and P’ are planes of unit 
magnification a ray entering the system and meeting the plane through P at a 
certain distance from the axis must emerge from the plane through P’ at the 
same distance from the axis. Then if this ray entered parallel to the axis the 
Point at which the entering and emerging segments meet must be on the plane 
through P'; similarly the plane through P could be found from the path of a 
ray entering parallel to the axis from the right. Thus to find P and P’ we have 
to trace the path of a ray through an optical system in the paraxial 
approximation. 

Paraxial Taytracing is a step-by-step or iterative process in which the path 
of a ray is calculated through each surface in turn. We consider a ray, 
originally from a point on the axis in the object space, and we suppose we 
have found that the ray meets a certain surface at a distance y from the axis 
and that its intersection length is I, as in Fig. 2.13. The refracting surface has 
curvature c and the refractive indices on either side are n and n’ as shown. Let 
the refracted Tay meet the axis at a distance I. Then to find I’ we proceed as in 
Section 2.3, i.e. we find I’ by the condition that the optical path length from the 
(intermediate) object point to its image through the refracting surface shall be 
the same for all rays, i.e. for the path along the axis and for the ray shown. We 
draw in the spherical wavefronts touching the refracting surface and then we 
oe down this condition, after subtracting the radii of the wavefronts, as 

S, 


Bs 2 wä : 
nye — Sny?/l = n'y? — An'y2/l' 
ör ) 2n) 2 y*/ 


g 
F — i = (n' — n)c. (2.8) 
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FIG. 2.13. Notation for paraxial raytracing. 


This is a conjugate distance equation for a single surface and the similarity 
with eqn (2.3), the conjugate distance equation for a thin lens, should be 


noted. 
For numerical raytracing it is more convenient to put this result in terms of 


the convergence angles x and x'; multiplying eqn (2.8) through by y we have 
nwa — na = —(n' — n)cy (2.9) 

Now let the next refracting surface be a distance d along the axis as in 
Fig. 2.14; then from the figure the new incidence height, y+; say, is given by 
Yer =y +d (2.10) 


FIG. 2.14. Transferring from one surface to the next in paraxial raytracing. 


We are now ready to use eqn (2.9) again, since the a’ from eqn (2.9) is the 
incoming convergence angle for this next surface; thus the process is taken 
through the system to obtain 2, and y, for the final, n™ surface. Paraxial 
raytracing is usually done numerically since the algebraic eliminations 
needed to express z, and y, in terms of x, and y, are cumbersome; however, 
some simple cases are given as examples at the end of the chapter. 
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Now returning to Fig. 2.12 we can see that if the incoming ray was parallel 
to the axis, i.e. coming from the axial point at infinity, the focal length must be 
given by —y,/z,; also the distance from surface n to the focus, sometimes 
called the back focal length, is — y,,/x,, and the position of P’ is obtained from 
the difference of these two quantities. Similarly by reversing the process the 
focal length on the object side and the position of P can be found. The two 
focal lengths are numerically equal when, as is usually the case, the object and 
image spaces have the same refractive indices, but not otherwise, as will be 
seen in Section 2.7. 

The paraxial or Gaussian properties of an optical system, i.e. the positions 
of pairs of conjugate points and the corresponding magnification, are known 
once the principal planes and foci are found, since we can then use the 
conjugate distance eqn (2.7), measuring | and | from the appropriate 
principal planes. Thus the optical system can be replaced by a skeleton 
consisting of the principal planes and foci, as in Fig. 2.15, and this can be used 


FIG. 2.15. Graphical construction for finding the image O' Oʻ, of an object O O,- 


in a simple graphical construction for conjugates. Given an object OO, we 
can take a ray parallel to the axis from O, (single arrow) which must travel 
across the space between the principal planes at constant height and then in 
the image space pass through the focus F’. A second ray from O, (double 
arrow) through F emerges parallel to the axis in image space and the 
intersection of these two rays must be O}, the image of O,. 

Figure 2.15 is, of course, a graphical representation of eqn (2.7); it can be 
used to derive another useful form of conjugate distance equation. Recalling 
that distances are taken with signs according to coordinate geometry, the 
image space focal length P’F’ = f” is positive in the figure; similarly the object 
space focal length PF = fis negative as drawn. For complete generality we 
assume these to be numerically unequal. Let F'O' = 2’, FO = z, again with 
appropriate signs, so that in the figure z’ is positive and z is negative. Then by 
similar triangles in the figure it can be shown that 


z/f= —n/n' = —1/m, 
z'If' = —n'/n = —m, 
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so that 
z= ff". (2.11) 
This is Newton’s conjugate distance equation, relating conjugates measured 
from the foci rather than the principal planes. 

The raytracing equations as derived above appear to allow only for lens 
surfaces in the optical system. It is easy to allow for a convex or concave 
mirror by the formal device mentioned in Section 2.2 of putting n' = —n fora 
mirror. We then obtain from eqn (2.8) the following conjugate distance 
equation for a mirror of curvature c, 


1/l + 1/1 = 2c (2.12) 
or, in terms of the convergence angles 
a’ +a=2c (2.13) 


These can easily be verified by a direct calculation as above for eqn (2.8). 


2.6. The Lagrange invariant and the power transmitted by an optical 
system 


We recall, from the Section 2.3, the magnification formula 
m=n'/n = aja, 
where « and a’ are the ray convergence angles and y and y/ are object and 
Image heights. From this we have 
nia’ = ow 

age quantities. We can generalize this 
s in an optical system (as in 
the left is the object for the 
Thus yx is the same in 


as a relationship between object and im: 
to intermediate images formed between lense 
Fig. 2.16), for the image formed by the system to 
Next part, and they share the same convergence angle x. 
all air spaces in the system. 

: Now suppose a plane surface of gl 
immediately after one of these intermediat 
height is unaltered if the image is at the surface, but the con 


ass of refractive index n is placed 
e images, as in Fig. 2.17. The image 
vergence angle 


n 


FIG. 2.16. The Lagrange invariant. 
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FIG. 2.17. An intermediate image formed at a plane surface. 


becomes x’ = x/n in Gaussian approximation. Thus finally-we have that the 
quantity 


H = nan (2.14) 


is an invariant throughout an optical system. In this expression y is the 
intermediate image size corresponding to the original object size, x is the 
intermediate convergence angle of the ray from the original axial object point, 
and n is the refractive index. The quantity H is usually called the Lagrange 
invariant; it was discovered independently by several people including 
Lagrange. It follows that for a system with refractive indices n and n’ in the 


object and image spaces the magnification is given by the statement of the 
Lagrange invariant: 


nan = n'x'y' (2.15) 


As well as relating magnification and convergence angle, the Lagrange 
invariant is a measure of the light flux or power transmission capability of an 
optical system. In a real optical system the convergence angle x on the object 
side is usually determined by the size of an iris diaphragm or aperture stop in 
the system (as in Fig. 2.18), as well as, of course, the distance of the object. 
Thus a small circular object of radius n mm on the axis will radiate into the 
lens inside a cone of half-angle z, and if the light power per unit area and per 
unit solid angle (luminance if it is visible light or radiance for any 
wavelengths) is B W mm~? sr~', say, the power collected by the lens will be 


Aperture 
stop 


FIG. 2.18. 


oita An aperture stop in a system determines the angle x of the accepted cone 
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7° Bn?a? W. If we ignore attenuation by reflection losses, absorption, and 
scattering, by conservation of energy this same power flow must occur across 
any surface of the lens system and across the final image surface. Thus on 
comparing the above expression with eqn (2.14) we see that the power 
transmitted by the optical system is proportional to the square of the 
Lagrange invariant. 

The brightness or luminance of an image formed by an optical system is the 
light power per unit area and per unit solid angle in the image. It is clear from 
the Lagrange invariant that the luminance of the image is equal to that of the 
object multiplied by a factor less than unity which allows for attenuation, so 
that no image can be brighter than the original object if object and image are 
in media of the same refractive index. If the indices are different this statement 
has to be modified in an obvious way since the refractive index appears in the 


Lagrange invariant. 


2.7. The relation between the two focal lengths 
It was stated in Section 2.5 that the object space and image space focal lengths 
are not numerically equal if the refractive indices in the two spaces differ. We 
now use the Lagrange invariant to obtain the relation between the focal 
lengths. Figure 2.19 is a diagram similar to Fig. 2.15 but with the object and 
image at the principal planes, so that the magnification is unity. Then the 
convergence angle x for the object PP, can be seen to be — PP,/ fand that for 
P'P; is P'P}/f' = PP,/f’. Thus eqn (2.15) takes the form 
(PP,)? _ 7 (PP,)? 

hi ri 


so that the required relation between the focal lengths is 


ay 


n 
oy eee (2.16) 
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FIG.2.19. Construction for the relation between the two focal lengths of an optical 


system. 
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2.8. Non-paraxial optics 


The results of Sections 2.3 to 2.7 are all based on Gaussian optics, according 
to which the lens and mirror apertures are supposed to be very small and the 
rays all make small angles with the optical axis. It is possible to define ‘small’ 
more precisely, as a mathematical order of magnitude (Welford 1962), but 
here we shall simply note that experimentally we do find well-defined images 
under Gaussian conditions but if we go beyond a certain range of angles the 
images look poorer (e.g. the image of a lamp formed by a convex spectacle 
lens tipped obliquely, so that the object and image are some way from the lens 
axis, is not sharp). Under such conditions we cannot rely on the simple 
approximate equations of this chapter and we have to trace rays exactly, i.e. 
according to Snell’s law. We then find that point objects do not form point 
images, i.e. there are aberrations. Optical systems such as camera lenses and 
microscopes have many lens components arranged so as to correct or 
compensate aberrations. 


Figure 2.20 shows how one kind of aberration arises when we use a thin 


FIG. 2.20. Oblique refraction by a thin lens. The rays in a section at right-angles to 
the diagram focus at a greater distance from the lens than those drawn. 


convex lens at an angle @ to its axis. We saw in Section 2.3 that a thin lens 
forms an image of a point object by adding an increment of curvature to the 
incident wavefront equal to 1/f, the reciprocal of the focal length; this 
happens because there is more glass at the centre than at the edge, so the 
wavefront is retarded more at the centre. In Fig. 2.20 this effect will still 
happen in the section perpendicular to the plane of the diagram, but in the 
plane of the diagram the width of the lens presented to the wavefront is less by 
g factor cos 0, and since there is, to a sufficient approximation, the same 
variation in thickness between centre and edge, the increment in curvature 
will be greater by a factor 1/cos 0. Thus the refracted wavefront will have a 
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greater curvature in the plane of the diagram than in the perpendicular 
section, and so the rays (normals to the wavefront) will not focus to a single 
point. This aberration is called astigmatism. 

i Another example of an aberration is chromatic aberration. The refractive 
index depends on the wavelength of light, for all material media—an effect 
called dispersion. For example, Fig. 2.21 shows the refractive index as a 
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FIG. 2.21. Dispersion curves of two optical glasses. The names and symbols of the 


glasses have no physical significance. 


function of wavelength for two common optical glasses. The brightest part of 
the visible spectrum lies between the two Fraunhofer spectrum lines Cand F, 
and it can be seen that over this range the refractive index of the crown glass 
varies by about 0.008. Since the focal length of a thin lens (see Section 2.3) is 
given by 1/f = (n — 1) (cĉ — ¢2)s it can be seen that the focal length of a thin 
lens made of this glass would vary by about 0.008/0.52 = 1.6 per cent over this 
wavelength range. This effect is a form of chromatic aberration. Fortunately, 


as can be seen from the figure, the flint glass has about twice this variation of 
refractive index over the same wavelength range, and it is thus possible to 
combine a converging lens of crown glass with a weaker diverging lens of flint 
glass, so as to cancel the chromatic aberrations but yet leave some converging 
power. Such a combination, as in Fig. 2.22, is an achromatic doublet. We can 
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i 
fi 


choose the powers of the components of the doublet as follows. For a single 
thin lens the power K is given by 


FIG, 2.22. An achromatic doublet. 


K = (n— 1)(c, — c3). 


Let the chosen wavelength range correspond to a change òn in refractive 
index. Then we have for the change in K by differentiation 


ôK = dn(c, — c2) 
or 


ôK = —-K (2.17) 


Now let the powers of the two components of the doublet be K, and K;, let 
their refractive indices be n, and n, and let the required total power of the 
doublet be K. From Problem 2.8 we have 


K=K, +k; (2.18) 
and from eqn (2.17) we have for the change with wavelength of the total 
power 
on, ôn, 


K, + K, 
mat a al 


ôK = 


If we require the power to be the same for the two wavelengths at either end of 
the chosen range we set ôK = 0 and we then have 


S ppp Si K,=0 (2.19) 
n =i ai. 


It can be seen that eqns (2.18) and (2.19) are a pair of simultaneous 
equations for K, and K,, giving the required solution for the achromatic 
doublet. There is still a choice of curvatures for the individual thin lenses and 
this is used to correct other aberrations. For a treatment of aberrations in 


general see W. T. Welford, Aberrations of the symmetrical optical system 
(1974). 


2.9. Afocal systems 


In our discussion of optical systems we have assumed tacitly from Section 2.3 
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on that if the object is at infinity the system will form an image at some finite 
distance. Afocal systems form images at infinity of objects at infinity. Passing 
over the trivial examples of plane mirrors and plane-parallel glass plates, 
which are certainly afocal, a non-trivial example is the Galilean telescope or 
opera glass, shown in Fig. 2.23. This system is theoretically adjusted so that 
the entering and emerging beams are both collimated, so that an object at 
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FIG. 2.23. The Galilean telescope, an example of an afocal system. 


infinity is imaged at infinity. Afocal systems fall awkwardly outside the 
formalism so far developed but this difficulty can be overcome by obtaining 
an expression for the Lagrange invariant in a space where the object (image) 
is at infinity. We call the angular subtense of an object at infinity 2, i.e. this is 
the angle which the parallel rays make with the axis as in the figure. Then for | 
very large but finite we can put 3 = n/land x = — y/l, so that from eqn (2.14) 
we have as / tends to infinity, 

= —nyB (2.20) 
now define the magnification as 


in a space where the object is at infinity. We 
n = B'/B,so that for 


the ratio of angular subtenses of the object and the image, m 
n = 1 on both sides of the system 


m= y/y’ (2.21) 


i.e. the magnification is the ratio of the diameters of the entering and emerging 
beams (provided there are no internal apertures to confuse this calculation). 
We cannot define a focal length or power for an afocal system. 


Problems 

2.1 Draw accurately rays refracted from air to glass (n = 1.53) at angles of incidence 
from 0° to 90°, at 10° intervals. : l 

2.2. i are i dia of refractive index n, and nz separated by a plane 

REUSE a ea ea to P, along straight-line segments 


boundary. Calculate the optical length from P, i sur 
from Pio a pont Q on the surface and from Q to P. By differentiation prove 


Sn i A : e Pi . 

23. SR object and image positions, indicating suitable rays, for (a) a 
camera, (b) a slide projector, and (c) a burning glass. 

2.4. Show that the optical path length along all rays between two wavefronts of a 


pencil is constant. 


2.8. 


29. 


2.10. 


2.14. 


. Show that a prism with small angle x devia' 


- A right-angle prism as in Fig. 
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Plot a graph of I’ as a function of l for a lens of focal length 100 mm, allowing the 
conjugates to range between — 1000 mm and +1000 mm. P 
What is the physical significance of positive values of land negative values of l' in 
the above example? 

A thin lens has curvatures c, and c, and refractive index n. Show that the 
analogue of eqn (2.9) for the thin lens is 


2 


a= -yK 
where K is the power of the lens, defined as the reciprocal of the focal length 
(Section 2.3). 


Two thin lenses of powers K , and K , are separated by a distance d. Show that the 
power of the combined system (reciprocal of the focal length) is 


K =K, + K,- dK,K,. 
Hence show that for any number of thin lens 
sum of the individual powers. 
A thin lens has focal length 50 mm. What is the magnification for the following 
object conjugates: 20 mm, — 100 mm, +100 mm. 
What is the focal length of a concave mirror of radius r? Calculate and draw to 


scale the image positions and magnifications for a concave mirror of radius 
100 mm for object distances 25 mm, 100 mm, 200 mm. 


es in contact the total power is the 


- A lamp filament in the form ofa flat ribbon of area 10 mm? radiates 10 W of light. 


(a) What is its luminance, and (b) how much power is collected by a lens of 
diameter 30 mm which is 100 mm from the filament? 

tes light rays through an angle 
(n — 1)x, where n is the refractive index of the prism. 

is made of material of refractive index 1.52. A 
beam of light is reflected at the hypotenuse face at the critical angle of incidence. 
Calculate the angle between the incident and emergent rays outside the prism. 
A Galilean telescope is to be made up of two thin lenses of focal lengths 50 mm 


and —20 mm. Find the separation between the lenses and calculate the 
magnification, 


3. Propagation of waves: interference 
and diffraction 


y I fancy. Specially in these black clothes feel it more. Black conducts, 
it?) the heat. 


Be a warm di 
reflects (refracts i 


James Joyce: Ulysses 


In the geometrical optics approximation we made the tacit assumption that 
rays intersect without interacting with each other. This would mean that rays 
meeting at a point, as at the image of a point source formed by an aberration- 
free lens, produce an infinitely small point image; this is known experiment- 
ally to be untrue. It would also mean that, if two beams of light overlap on a 
screen, the resultant light intensity (power density) is the sum of the intensities 
in the individual beams; experimentally this is sometimes true, sometimes 
false. In this chapter we examine these effects, which are, of course, examples 


of diffraction and interference. 


3.1. Interference of two beams 

We saw in Chapter | that nominally monochromatic light beams have very 
rapid random phase variations. Thus in order to see interference effects 
between two beams we must ensure that these phase variations are the same 
and in step in both beams. This is done by taking both beams from the same 
light source. It is simplest to think first about beams of plane waves 
intersecting at an angle 0. There were many classical experiments in which 
this was done in different ways. Figure 3.1 shows one way In which it might be 
done with modern equipment. If the region where the beams intersect is 
examined, e.g. by putting a white screen there or by scanning a small 
photodetector across it, straight dark and light bands perpendicular to the 
plane of the diagram are found, ie. interference Sringes. Bright fringes are 
formed whenever the two waves are in phase. The inset to Fig. 3.1 shows 
wavefronts from the two beams at a given instant and from this it can be seen 
that the spacing o between the fringes is given by 


o = Ìjsin 0, (3.1) 
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FIG. 3.1. A convenient way to form interference fringes. Two beams of collimated 
light from the laser intersect at an angle 0, and fringes are formed where the beams 


cross. The beam-splitter is a plate of glass with a thin semi-transparent film of 
aluminium on one surface. 


since it corresponds to the intersection of one wavefront of beam 1 by 
successive wavefronts of beam 2. 


In slightly more detail, if we suppose the beams are of equal intensity we can 
represent their complex amplitudes by (Section 1.3), 
beam 1: E exp {—2niz//}, (3.2) 

beam 2: E exp {—2zi(z cos 0 + y sin 0)/A}. 


The total complex amplitude in the interference pattern across the plane 
2 = 0, which we can take to be the plane of observation, is then 


E{1 + exp (—2zi(y//) sin 0)!, 


and the observed intensity is the squared modulus of this, i.e. 


2 2h 
ty) = 26°41 + c0s(7 sin o)} 


(3.3) 
= 4E? cos?( y sin o). 
2 ) 
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This function, giving what are usually called cos? fringes or two-beam fringes, 
is plotted in Fig. 3.2. Since each time the argument of the cosine increases by z 
we go through a complete period of the fringes, we have verified eqn (3.1) 
above for the fringe spacing. The cos? light-intensity distribution can be 
verified by photo-electric scanning, but visually the fringes appear to have 
much narrower dark regions than in the figure. This is a consequence of the 


y 


FIG. 3.2. The light-intensity distribution across cos? fringes. 


very nonlinear response of the eye (Section 1.4), and it is often misleading in 
making a quick visual assessment of an interference or diffraction effect. 
If the beams are of different amplitudes, and therefore of different 
intensities, the minima in the fringe system will not be zeros, i.e. there will not 
be maximum contrast or visibility in the fringes (see Problem 3.3). 
Generally, we are mainly interested in the fringe spacing and contrast 
rather than in the details of the intensity variation in the fringes. The maxima 
occur where the two beams are in phase. We can generalize this immediately 
by noting, from Chapter 2, that points ofequal phase occur where the optical 
path lengths from the source via the two interfering beams to the point in 
question are the same, or where they differ by a whole number of vacuum 
wavelengths. This is then applicable to interference in media of different 
refractive index. A good example is the oil film on water or, more generally, a 
layer ofindex n and thickness d, as in Fig. 3.3. Let a collimated (parallel) beam 
of wavelength 4 meet the layer at an angle of incidence J. Some light is 
reflected at each surface, and there will be a path difference between 
corresponding wavefronts. This is indicated in the figure where £, and Z, 
have originated from the same wavefront after a certain time. It is a simple 
exercise in the use of Snell's law (Chapter 2) to show that the optical path 


difference between the two beams is 
2nd cos I’, (3.4) 


where I’ is the angle of incidence inside the layer. Thus we should expect a 


bright fringe to be formed in the film whenever 
2nd cos I' = NA (3.5) 
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E 2nd cos I’ 


FIG. 3.3. The difference of optical paths between beams reflected at the first and 
second surfaces of a layer of thickness d and refractive index n. 


where N is an integer, the order of interference. In fact under certain 
conditions there is a change of phase of z of one or both of the beams on 
reflection and so the right-hand side of eqn (3.5) would be written (N + 4)Aif 
the phase change occurred at one but not the other surface. The phase change 
on reflection is obtained from electromagnetic theory (see e.g. F. N. H. 
Robinson, Electromagnetism, OPS 1); here it is sufficient to note that for 
angles of incidence below the Brewster angle (Section 4.3) or for most 
practical purposes below 45° a phase change of z occurs when the light is 
incident on the interface from the medium of lower refractive index, ¢.g- 
reflection at a water surface in air, but not when the light is incident from the 
medium of higher index. Thus if the film were of oil of refractive index 1.45 
floating on water (n = 1.33) there would be a phase change at the air-oil 
interface but not at the oil-water interface and (N + })A should be used in eqn 
(3.5). Again, if Newton’s rings are formed by placing a convex lens surface on 
a plane surface (Fig. 3.4) the value of d is zero at the point of contact, the 
centre of the fringe pattern, and the centre will be dark. 

Equation (3.5) shows how variations in the thickness d of an oil film are 
indicated by the shape of the interference fringes. Also in Newton's rings 
(Fig. 3.4) the successive fringe diameters indicate the variation of the 
thickness of the gap between the two glass surfaces. This equation also has 
other applications, as will be seen in the next section and in Chapter 6. 


3.2. Interference with extended and polychromatic light sources 


In discussing interference between the beams reflected from an approximately 
Parallel layer in the previous section we assumed the incident light was 
monochromatic and collimated over a reasonable area of the layer, and this 
implied that it came from a single point source at a great distance. However, 
we normally see such effects under less stringent conditions: the source may 
Cover a large extent, e.g. the sky for oil films or a sodium lamp for Newton’s 
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FIG.3.4. Newton's rings formed by interference between beams reflected at a curved 
interface. (a) The apparatus. (b) The spacing to scale of successive bright fringes. If the 
surfaces are in contact the central fringe is dark. 


rings, and it may have a range of different wavelengths in it. We can 
understand this by referring again to eqn (3.5). 

First, consider an extended source. Each source point will form its own 
interference fringes, and these fringe systems will be independent, i.e. the 
intensities will add. The angle of incidence 7’ will be different for different 
source points, so that the path difference (eqn (3.4)) at a given part of the layer 
will vary, and thus the fringe maxima will not coincide. However, if the 
thickness d is small enough quite a large variation in the angle of incidence is 
needed to change the path difference by, say, 4/4, so that the fringe systems 
will all more-or-less coincide, and fringes can be seen with an extended source. 
Problem 3.5 illustrates this. In Chapter 1 we spoke of the need to have 
coherence between light beams if they are to show interference effects. In the 
present case we see that the source has to be restricted in size (more exactly in 
angular subtense) to ensure that the beams reflected from the two surfaces of 
the layer are coherent. 

If we now put eqn (3.5) in the form 


2nd cos I’ _ 
À 


N, (3.5a) 


we see that for given thickness and angle of incidence the order of interference 
N will vary with the wavelength. If N is non-integral it is interpreted as the 
number of wavelengths, possibly fractional, of path difference between the 
interfering beams. Thus if the source is polychromatic the fringe systems from 
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the various wavelengths will again be displaced and will add in intensity to 
give a more-or-less uniform appearance, as in Fig. 3.5. However, again if the 
layer is thin enough, i.e. d is small enough, N in eqn (3.5a) will vary very little 
over a reasonable wavelength range, and fringes of good contrast will be 
obtained. Thus for coherence we have to restrict the wavelength range. 
Two-beam interference effects are used in many devices. A few of these 
devices are discussed in Chapter 6, and others are described in classical texts 
(e.g. Ditchburn 1952; Longhurst 1973). From the above we can make a 
generalization which applies with appropriate modifications to all of these. 


Two-beam interference effects can be obtained with polychromatic 
extended light sources. The contrast or visibility of the fringes depends on 
both the bandwidth (frequency or wavelength spread) and the angular 
extent of the source. Generally both of these must decrease with increasing 


optical path difference between the beams if the visibility or contrast is to be 
kept high. 


In Chapter 6 we shall see applications of this principle to astronomy and 
spectroscopy. 


FIG. 3.5. Superimposition of intensities of fringes formed in polychromatic light. 


3.3. Diffraction 


In the previous section we started by discussing interference between 
collimated beams, as in Fig. 3.1. The beams were regarded as composed of 
plane waves of indefinitely great width—ice. simply as described by, say, eqn 
(1.5)—with no restriction placed on the Position vector which indicates the 
point In space at which we consider the wave disturbance. In fact the beams 
are limited in extent by the diameters of the collimator lenses in Fig. 3.1, and 
these certainly cannot be considered as indefinitely large. This restriction 
does not materially affect the description of two-beam interference; however 
when we examine the propagation of a single collimated beam from a point 
Source we find that it does not propagate indefinitely with a sharply defined 
rim given by the edge of the lens; instead, the disturbance spreads out and 
becomes uneven near the edge in a complicated way. This is indicated in 
Fig. 3.6, which shows the light intensity observed in line with the edge at 
different distances. We now discuss this effect, known as diffraction. 

We can simplify the discussion by considering first a collimated beam of 
rge diameter which meets an Opaque straight edge: experimentally the effect 
very much as in Fig. 3.6, i.e. it does not much matter if the edge is curved or 


la 
is 
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FIG. 3.6. Diffraction at an edge. The graphs show the light intensity near the 
geometrical shadow at two distances from the edge in the ratio 1:4. 


straight. Similar effects are found in the propagation of all waves; in sound 
and water waves it is easy to observe diffraction at an edge or any obstacle 
because the wavelength is large. Qualitatively we can see why diffraction 
occurs (i.e. why there is not a sharp shadow at all distances from the edge in 
Fig. 3.6) in terms of a basic physical principle that discontinuities do not 
occur in the wave representation. Thus the wave disturbance cannot stop 
abruptly at the line of the geometrical shadow, but must decay gradually. 
However, this does not give us a quantitative picture. 

It can be shown (Chapter 10 of Electromagnetism, OPS 1) that in a uniform 
medium the electric and magnetic fields of electromagnetic radiation both 
obey a partial differential equation—the equation of wave motion. Thus for 
one component of the electric field vector, E, say, we have, in a uniform 
dielectric medium 


V?E, = H,ho€,ĉ0 Ëx (3.6) 


where jug and £ọ are the permeability and permittivity of vacuum and 4, and £, 
are the relative permeability and the relative permittivity of the medium. For 
monochromatic (single-frequency)waves we can put E, = E exp iwt in eqn 
(3.6), where E now becomes a complex amplitude (Chapter 1), and we obtain 


VE + W H,Ho££0E = 0, (3.7) 


the time-independent equation of wave motion. 

To solve the diffraction problem in complete generality for electromagnetic 
waves we should have to solve eqn (3.7) and five others like it for all 
components of E and H, using appropriate distributions of u, and e, and 
putting in suitable boundary conditions at the diffracting obstacles. This has 
been done for some simple cases, and the results have been verified 
experimentally by measurements with microwaves (À ~ 10 mm). In the 
optical region we can greatly simplfy the problem by considering just one 
component of E (the scalar wave theory) and by using some approximations 
which are good for most regions of practical interest. Roughly, these regions 
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are those at a large distance from the diffracting structures, where ‘large’ 
means many wavelengths and where the diffracting angles are small, i.e. less 
than, say, 0-1 rad. Figure 3.7 illustrates these regions for diffraction of a wave 
at an aperture in a screen. 

We then have the following physical picture. In wave propagation, as the 
disturbance progresses through the medium, each point reached by the 
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FIG. 3.7. The region in which scalar diffraction theory can be safely applied. 


g point for the disturbance which moves 
Pictured in the case of a transverse 
hed string or for water waves from an 
at an aperture in a screen we proceed as 
eft to right and meets a plane opaque 


1 
ita Huygens’ Secondary wavelets. Two secondary sources are indicated in the 


) 
) 


Propagation of waves: interference and diffraction 45 
These ideas were roughly formulated by Huygens in the seventeenth 
century, refined by Fresnel early in the nineteenth century, and given a 
definite mathematical form by Kirchhoff about 80 years later. Huygens and 
Fresnel put their ideas in the form of the physical picture we have sketched 
but Kirchhoff obtained his result as a solution of eqn (3.7), with certain 
simplifying assumptions which restrict the applicability as outlined above. 
We give here a special case which is simpler than Kirchhoffs formulation but 
which applies to many problems of current interest. 
We take a rectangular coordinate system as in Fig. 3.9 with the x, y-plane 


Normal 
to screen 


FIG.3.9. The diffraction integral. The distances č and y as shown are both negative. 


as the plane of the screen, so that the arriving wavefronts are parallel to this 
plane, and we wish to determine the complex amplitude at a point P with 
coordinates (č, n, Č). According to the above ideas, we assume that an element 
dx dy of the wavefront in the aperture acts as a source of a secondary 
spherical wavelet of strength proportional to the area of the element. Thus the 
complex amplitude at P due to this element is 

Eo, Ean U IKR ae dy, 

A R 4 


where k = 27/4 as in Section 1.3. Eo is a constant complex amplitude. The 
factor Eo// occurs naturally in the development according to Kirchhoff. Here 
we shall accept the factor as a way of keeping the equations dimensionally 
correct. The total effect at P due to all the incident wave which passes through 
the aperture is then 


Eo {fl & 
Ep = i f | Rowe (—ikR) dx dy. (3.8) 
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The integral is to be taken over the whole of the aperture and R is, of course, a 
function of x and y as different elements of the wavefront are taken. 
To express R in a manageable way, we have, from Pythagoras’ theorem, 


R= (xP + y-n +2 


=x? + y? — 2(ëx + ny) + č +n? + C. 


Thus 
2(ëx + ny) x? +y? ë+ £y 
R=Q1 = + 4 2 
EER 
Spe S CI Cea? 


+ Arei (3.9) 
t 2¢ 2¢ 


on expanding as far as the first term by the binomial theorem. As we should 
expect, R consists of a term č + (č? + 92)/C which is independent of x and y 
and which is large in terms of 4, together with some smaller terms. Whenever 
R increases or decreases by one wavelength (as a Consequence of varying x 
and y in the integration) the exponent in eqn (3.8) changes: by 27, and the 
complex amplitude in the integrand goes through a complete cycle. Thus 
small changes in R are important in the exponential. On the other hand, since 
we are supposing R is large compared with 2, such changes can be ignored in 
the 1/R factor, and this can be written 1/C and taken outside the integral. Since 
We are assuming small diffraction angles we need only consider linear terms in 
č/¢ and /¢ in eqn (3.9). Eqn (3.8) then becomes 


— Fo TA ik ane aaa 
Bo 7g 0 (ik) » ffep FF es + my) — H e +y h dxdy (3.10) 


We now make an important simplifying assumption, that Cis so large that 


can be neglected. This means that the maximum value of 
ere in the aperture is much less the 


E i27 
Eps Sa [feof (ex + wi} dx dy. 
Tan 2 ) 


We can apply this immediately to the simple but useful case of a square 
aperture of side a, as in Fig. 3.10. We take the origin at the centre of the 
square, and we let ¢ be the 2-coordinate of the Plane in which we want to find 
the diffraction pattern. The double integration in eqn (3.11) splits into two 
factors, 


(3.11) 
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FIG. 3.10. Far-field diffraction at a square aperture. 


The notation sinc x is used for (sin zx)/zx, so that the diffracted complex 
amplitude at (č, n) is 


E(é,) = Bo sinc ($) sinc (2). (3.12) 
4 


À¢ À% 


The light intensity I(č, n) is the squared modulus of the complex amplitude 
(Section 1.3), so we have 
aE, . (=) W (A 
ën) = sinc? | = 2(—). (3.13 
1(é,n) zoz sinc ic sinc i ) 


RE AC 


The general form of this pattern, a central maximum of intensity with 
surrounding subsidiary maxima and minima, is indicated in Fig. 3.10 by the 
hatching (see also Problem 3.7). In describing such patterns we usually 
rescale the intensity by normalizing it to unity at the centre of the pattern, i.e. 
at € = 9 = 0. For eqn (3.13) this simply means omitting the factor at E2/270?; 
this is acceptable for most problems in physical optics, but the physical 
dimensions are lost. The factor a* EG/2 2 indicates that the central intensity is 
Proportional to the fourth power of the linear dimensions of the aperture and 
inversely proportional to the square of the wavelength. These are general 
Tules applying to all diffraction of this kind, where quadratic terms in the 
aperture are negligible. From the argument of the sinc function in eqn (3.13) 
the lateral scale, i.e. distances between successive maxima, varies inversely as 
the aperture size and directly as the wavelength, and these are again general 
rules. 


Figure 3.11 shows this diffraction ] 
contours of constant intensity in the pattern, sometimes called isophots, and 


pattern quantitatively. The lines are 
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FIG. 3.11. Contours of constant 
aperture, in the far field; two of the si 
the other six neares 


intensity in the diffraction pattern from a square 
de lobes nearest the central maximum are shown; 
t side lobes are of the same shapes as these two, by symmetry. 


the intensity scale is normalized to unity at the centre. The lateral scale can be 
obtained from the lines of zero intensity, since these correspond to values of z. 
2n, 37, ... in the argument of the sinc functions in eqn (3.13). 
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3.4. Diffraction in the far field 


Suppose that in Fig. 3.9 we observe the diffracting aperture from the point P 
being found and Suppose also that we could see 
the variations in complex ampli i 


he aperture due to the variation in 
done by, for example, a suitably arranged i 
made that the term in x? + y 


so large that this phase variation over 
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than quadratic, since it is in fact the exponent in eqn (3.10). The distance ¢ 
from the diffracting aperture then satisfies 

C>a/i, (3.14) 


and we have far-field diffraction or Fraunhofer diffraction. If on the other 
hand the quadratic term is not negligible then 


o<a7/A, (3.15) 


and we have near-field or Fresnel diffraction. 

We shall consider first far-field diffraction. In the limit as ¢ gets very large 
we can put č/č = u, n/¢ = v, where u and v are angular coordinates, and we 
speak of light diffracted into a direction (u, v). Eqn (3.13) is then written in 


normalized form 
I(u, v) = sinc? (5) sinc? (=) (3.13a) 


We do not actually have to go to a distance given by eqn (3.14) to observe 
far-field diffraction. From Section 2.3 parallel rays in a direction with 
components (u, v) on one side of a lens come to a focus at a point with 
coordinates (fu, fv) on the focal plane of the lens. Thus an objective 
(collimator in reverse) can be used to bring the far field to a convenient 


distance, as in Fig. 3.12. 


Diffracting 
aperture ae 


eld to a convenient place for observation. The far-field 
s formed at the focal plane of the lens. (Ifin addition 
between different parts of the far-field pattern to 
f the aperture we must also put the aperture 


FIG. 3.12. Bringing the far fi 
diffraction pattern of the aperture i 
we want the phase relationships 
represent correctly the Fourier transform o! 
at the front focal plane of the lens.) 


We can re-write eqn (3.11) normalized to have unity intensity at the centre 
of the far-field, i.e. in the direction of the incident wave, given by č = 7 = 0, 


and we can also write the far-field coordinates as the diffraction angles (u, v). 


We then have for the normalized complex amplitude at (u, v), 


ns 2 
: | | exp fr (ux + w} dx dy, (3.11a) 


E(u, v) = 3 
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where A is the area of the diffracting aperture and, as before, the integration is 
over the aperture. Next we specify the diffracting aperture by means of a 
function F(x, y), which is put as a factor inside the integral and which is 
defined as equal to unity for (x, y) inside the pupil and zero outside. This 


enables us to formally extend the limits of integration to infinity and eqn 
(3.11a) becomes 


E(u, v) -+ Mi F(x, y) exp {= (ux + wh dx dy. (3.1 1b) 


The introduction of the function F(x, y) is more than a mere formal device. It 
need not be only a binary function (i.e. taking only values 0 or 1); it can be 
modified to give the effect of a screen across the aperture which absorbs some 
light or which has a phase-changing effect. Both these devices are useful; if 
there is absorption the normalizing factor strictly has to be interpreted as 
(JF (x, y) dx dy)7?, 

Equation (3.11a) can be interpreted as a Fourier transform rel 


ationship 
(see Appendix). We see that if eqn (3.11b) is rewritten in the for 


m 
; rr F(x,y ; 
S(s, t) = | f Pa exp Jionisx + w} dx dy (3.11c) 
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the far-field diffraction pattern of a regular lattice of diffracting points, i.e. it is 
the Fourier transform of a periodic array of delta functions. 

The far-field pattern for a circular aperture must be radially symmetrical 
about the axis of the aperture. Taking only one angular coordinate u, which is 
the (small) angle between the axis and the direction in the far-field in which we 
are interested, it is found that the complex amplitude in the far-field is in 
normalized form 


fe 2J , (2nau/A) 


16 
2nau/i (3:16) 


E(u) 
where J,(z) is the Bessel function of the first kind and first order. This 
function, which is available from many books of tables, behaves like an 
attenuated sine wave.t The intensity in the diffraction pattern is plotted in 
Fig. 3.13 with logarithmic ordinate scale. It is known as the Airy diffraction 
pattern, after the astronomer G. B. Aity, who calculated it as the theoretical 
form of a star image. 

We can observe far-field diffraction effects in the optical region most easily 
with a laser as light source. It is necessary that all parts of the wavefront in the 
diffracting aperture should be able to interfere with each other, i.e. they must 
be coherent with each other in the sense explained in Section 1.5. The light 
from a helium-neon laser in correct adjustment behaves as if it came from a 
single point source, and it is therefore coherent over the whole wavefront. 
Figure 3.14 shows a typical arrangement of apparatus for producing far-field 
diffraction patterns; diffracting screens of different shapes are placed in the 
collimated beam. Many beautiful examples of far-field diffraction patterns are 
given elsewhere (e.g. Lipson 1972). 

It is often useful to estimate the general features of a far-field diffraction 
pattern without a detailed calculation. It is usually true that if the diffracting 
aperture is simple in shape and has no blanked-off area in the middle then the 
maximum intensity in the far-field is in the direction of the incident wavefront. 
If we go away from this direction by an angle 2/d, where d is a distance of the 
order of magnitude of the width of the aperture, this will be roughly the 
direction of the first minimum. Thus the angular half-width of the central 
maximum, i.e. the full width of the pattern at half maximum intensity, is of 
order i/d. For example, a laser beam about | mm in diameter (i.e. as it comes 
from the laser) will spread by diffraction, even ifnominally collimated, over an 
angle of about 1 mrad, but if the beam is first spread out by means of a beam- 
expander (as in Fig. 3.15) to, say, 20 mm, it will only spread at about 


0.05 mrad. 
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FIG. 3.13. The Airy pattern, or light-intensity distributio 
source of monochromatic light formed by as 
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ystem with a circular aperture. The 
dulus of the amplitude, as in eqn (3.16). 


3.5. Diffraction in the near field 
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FIG. 3.14. A diffractometer. The far-field pattern of the screen is formed at the focal 
plane of the objective. If the complex amplitude in the far-field pattern is to have the 
correct phase distribution according to eqn (3.1 1c) the screen should be at a distance f 
from the objective, but this is not necessary if only the intensity is to be observed. 


FIG. 3.15. A beam-expander. This is an afocal system, i.e. it forms an image at 
infinity of an object at infinity. The incoming beam is expanded in the ratio of the focal 


lengths of the two lenses. 


the complex amplitude at a distance ¢ from the plane of the slit and we take a 
coordinate č as in Fig. 3.16. Then from eqn (3.10) we have to evaluate 


fa 


Ep = [exp fe - | dx 


ae 
ing ia -in 
=ex [exp = (x-€ 4, dx. (3.17) 
a 


We can drop the factor outside the integral since it will give unity on taking 
the squared modulus to get the intensity, and eqn (3.17) can then be expressed 
in terms of Fresnel integrals; these are defined as follows 


ce) = f cos 5 Pde 
o 


fn E (3.18) 
S(z) =| sin5¢ de. 


0 
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FIG. 3.16. Near-field diffraction from a slit. 


After some simple changes of variable (eqn (3.17) gives 
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where again the positive sign is to be taken if P is 
shadow and the negative if it is on the dark side. 
Figure 3.17 shows the light intensit 
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get the diffraction pattern across the shadow 
ge, by putting č — ła = h, say, so that h is the 
y, and letting a tend to infinity. From tables of the 
S(%) = 0.5, so that the complex amplitude across 


on the bright side of the 


Propagation of waves: interference and diffraction 55 


[a ae 2 4 6 


FIG. 3.17. The near-field diffraction pattern from a straight-edge. The ordinate is 
the light intensity, given by the squared modulus of the expression (3.20); the abscissa z 
is the argument ,/(2/2¢)|h] of the Fresnel integrals. The geometrical optics shadow edge 


is at z = 0. 


diffraction pattern from a single edge it can be seen that no matter how large ¢ 
is taken the terms in x? + y? cannot be neglected in the diffraction integral, i.e. 
it is a near-field pattern at any distance. Thus in observing the occultation of 
a star by the Moon or by a planet the shadow sweeping over the surface of the 
earth has a light intensity distribution similar to Fig. 3.17 to a suitable scale, 
but with due allowance for the spread of wavelengths in the light of the star. 


3.6. Interference, diffraction, and the photon picture 


Interference and diffraction are now thought of as essentially wave pheno- 
mena, although there have been attempts (e.g. by Newton) to explain 
interference using a particle model. Yet the dual nature of light—wave and 
particulate—is well established, and the apparent contradiction at an 
elementary level between these aspects is perhaps more striking than for any 
other particle-cum-wave. The contradiction is sometimes handled as in 
Section 1.6 simply by saying that we have to use the wave representation for 
some purposes and the particle representation for others, but it is possible to 


enlarge on this approach. 
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Suppose we have an interferometer, such as in Young’s experiment 
(Section 6.1), and we wish to discuss its properties in terms of photons rather 
than waves. Then according to the methods of quantum mechanics we have 
to consider the passage of a single photon at a time through the apparatus. 
Indeed, with an ordinary light source and an apparatus of reasonable size, it 
can easily be shown that it is unlikely that more than a single photon will be in 
transit through the apparatus at any given time. Thus we imagine a detector, 
in the plane where the fringes are to be formed, which builds up the fringe 
pattern as dots, one for each arriving photon in the Position where it activates 
the detector. This experiment was first done by G. I. Taylor using photo- 
graphic plates, and it has since been repeated in many different ways. The 
results always show that the photons appear at first to be arriving at random 
positions, but gradually, as more photons arrive, the classical interference 
pattern as predicted by wave theory is built up. 

We explain this by saying that the Photon does not follow a definite path 
through the apparatus, but that it can follow any of several different paths. 
Clearly there should be a high probability for the Photon to follow a path 
terminating near the position of an interference fringe maximum, and a low 


probability for it to arrive near a minimum. To calculate these probabili 
we should have to solve Schrédin: 
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corresponds to the classical condition, since momentum includes frequency 
and direction. 

The above explanation is only a sketchy attempt to explain a topic which is 
in detail very complicated. However, much the same argument applies to 
interference and diffraction of all elementary particles. From a particle 
viewpoint we are dealing with varying probabilities of arrival at different 
points, and the wave picture provides, in effect, a convenient way of 
calculating these probabilities. Then, after calculating the probabilities by 
using wave theory, we can call them intensities if we are dealing with a large 
enough flux of particles. 


Problems 


3.1. Two beams of radio waves of frequency 3 MHzintersect at an angle of 10°. What 
is the interference-fringe spacing? 

3.2. How many fringes are formed per millimetre if light beams of wavelength 
632.8 nm intersect at 5°? 

3.3. Two beams interfere at an angle 0. If the complex amplitudes are in the ratio 2:1, 
show that the intensity in the fringe system has the form 


2n 
efs + 4 cos E ysin o)}. 
4 
and plot this function. 


3.4. Two glass plates are nearly in contact and make a small angle 0 with each other. 
Show that the fringes produced by interference in the air film have a spacing equal 
to 2/20 if the light is incident normally. 

3.5. A monochromatic source of wavelength 546 mm is 25 mm in diameter and is 
placed 500 mm above an air film between two glass plates. Show that the air film 
can be about 0.2 mm thick before the fringes begin to lose visibility. (Hint. The 
range of angles of incidence is from 0° to 0 = arctan 12.5/500; the range of path 
differences is from 2d to 2d cos 0, and this should be less than 7/4.) 

3.6. An air film is 100 pm thick and fringes are to be formed in it from a polychromatic 
source of mean wavelength 550 nm. Approximately what wavelength range can 
be used? (Choose a range from A, to 2, such that N does not vary by more than 
1/4. 

kyi o the functions sinc x and sinc? x for values of x up the the third zero. Tabulate 
the values of the subsidiary maxima. 

3.8. An aperture 5 mm in diameter diffracts light of wavelength 0.5 um. How far away 
must a screen be placed to show the far-field diffraction pattern? 

3.9. A teaching laboratory has a 2 m long optical bench. Suggest a suitable aperture 
size for demonstrating far-field diffraction with a helium-neon laser. 

3.10. Plot the amplitude and intensity distribution in the Airy pattern. Find by 
numerical or graphical approximation the intensity in the first bright ring, the 
radius of the first dark ring, and the radius at which the intensity is half the central 
maximum. 

3.11. A beam from a ruby laser (694 nm wavelength) is to be used in measuring 
variations in the distance of the moon by timing its return from mirror systems 
arranged on the moon. If the beam is expanded to 1 m diameter and collimated, 
estimate its size at the moon. (Moon's distance ~ 3.8 x 105 km.) 


4. Polarization 


4.1. Everyday aspects 


Most of us have observed polarized light, through ‘Polaroid’ sunglasses. 
What we are seeing arises because light can have asymmetry about the 
direction of propagation; the appearances change when the glasses are 
rotated. Thus in the wave representation of light the disturbance cannot be 
along the direction of travel, as it is in sound waves. Simil 
aerials also have directionality, 
tromagnetic waves in which the d 
of propagation. 
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4.2. Kinds of polarized light 
To find out what a polarizer does we must first describe polarized light. 
Suppose we have a beam of collimated light travelling in the z-direction, as in 


Fig. 4.1. To polarize it we put a polarizer in the beam with its polarizing 
direction parallel to the y-axis,t and we then have the beam polarized with the 


y 


Polarizer 


FIG. 4.1. The light transmitted by two polarizers at an angle 0. 


electric vector in the y-direction. If we follow this with a second polarizer with 
its axis at right-angles to that of the first then no light is transmitted. If the 
second polarizer has its axis at some other angle 0 we resolve the incident 
electric field E, into components E, cos 0 and E, sin 0 parallel and per- 
pendicular to the new direction, and only E, cos 0 is transmitted. Thus we 
should expect the transmitted light intensity to vary as cos?0. Experimentally 


this is found to be so, and this is confirmation of our model of a polarizer and 


of polarized light. 
The light produced by a polarizer as described above is said to be plane- 


polarized, because the electric vector remains parallel to one plane—that 
which contains the direction of propagation and the polarizing direction of 
the polarizer. We can propose other kinds of polarization, as follows. Light 
plane-polarized in the y-direction has an electric field of the form 


E,(t, r) =(0, Ey, 0), (4.1) 
otation (A,, A,, Ax) denotes the three 


ations are merely special forms of eqn 
is taken. We can now suppose added 


where E, = E, cos (wt — kz) and the n 
components of the vector A. These equ 
(1.5) in which the real component only 


n, we use the polarizer as in sunglasses, i.e. we look 
h shiny surface and turn it so that the reflection from 
irection of the polarizer is then vertical, as will 


+ To find the polarizing directio 
through it at a horizontal smoot 
the surface is minimized. The polarizing di 
be seen in Section 4.3. 
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to this a coherent beam travelling in the same direction but polarized at right- 
angles and with different phase and amplitude, 


E,(t,r) = (E,, 0,0), 
(4.2) 
E, = E, cos (wt — kz + €) 


(An experiment in which such an addition occurs is described in Section 4.4.) 
To examine the result we take z = 0 and we suppose first that there is no 
phase difference, i.e. £ = 0. Then at all times the resultant electric field makes a 
constant angle arctan (E,/E,) with the x-axis, and so we have plane-polarized 
light, but with the plane of polarization in the direction arctan (E,/E,). 


However if there is a phase difference, the direction of the resultant field will 
change with time; e.g. if ¢ = n/2 we have 


E; = —E, sinot, (4.3) 
E, = E, cos wt, 


and so the tip of a vector representi 


é ng the electric field traces out an ellipse (as 
in Fig. 4.2) with angular frequency 


w. We then have elliptically polarized light. 
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FIG. 4.2. The electric 
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chosen axes, is constant in time. Unpolarized light, sometimes called ‘natural 
light’, can now be described in a more general way than in Section 1.5, as light 
in which the state of polarization, in general elliptic, changes rapidly and 
randomly in time. We can also have light beams which are mixtures of 
unpolarized and polarized (plane, elliptic, circular) light. These are said to be 
partially polarized. Most light from everyday sources is partially polarized, 
e.g. sky light, sunlight, light from metal filament lamps, and light reflected 
from smooth surfaces, but sometimes the polarized component is a small 
proportion of the total intensity. 

In describing elliptic polarization a sign convention is necessary for the 
direction of rotation. The convention is that the rotation is clockwise looking 
towards the source for right-hand elliptic or circular polarization. 


4.3. Production of polarized light 


The reflection factor, or ratio of reflected to incident light intensity, for a 
smooth interface between transparent media of different refractive indices can 
be calculated for electromagnetic waves (see eg. Chapter 10 of 
Electromagnetism, OPS 1). Let the light have a angle of incidence 0, from a 
medium of index nj, as in Fig. 4.3. If the incident light is plane-polarized with 
the electric vector parallel to the plane of incidence, as in the figure, the 
reflection factor for light intensity is 

n,/cos 0 — n,/cos ny (44) 

P = \m/cos 02 + n,/cos 0, / ° 

and for the other polarization, with electric vector perpendicular to the plane 
of incidence, the reflection factor is 


(2 cos 0; — n, cos Ay (435) 


ny cos 0, + n, cos 0, 
\ NN 


Normal 


Refracting 
interface 


FIG. 4.3. Reflection and refraction of p-polarized light at a dielectric interface. 
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The subscripts p and s, which are almost universally used, stand for parallel 

and senkrecht (German for parallel and perpendicular). : E 
The ratios R, and R, are plotted in Fig. 4.4 for reflection ata a 

interface, i.e. taking n, = 1,n, = 1.5. It can be seen that R,is zero atan angle 

of incidence of about 57°, so that at this angle the reflected light will be 

completely plane-polarized perpendicular to the plane of incidence. It is easily 
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FIG. 4.4. Reflection factor of a glass surface for p- and s- 
of angle of incidence. 
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more advanced texts (e.g. Ditchburn 1976), where it is shown that for a given 
direction in a crystal two distinct plane waves can in general be propagated; 
they have different speeds and are polarized at right angles to each other. Also 
a plane wave refracted into such a crystal in general separates into two waves 
with different velocities and again orthogonally polarized. Another way to 
view this is in terms of propagation from a point source inside the crystal. For 
an isotropic medium the speed would be the same in all directions and the 
wavefronts would be spherical. For a crystal the surface which the disturb- 
ance would reach in a given time is called the ray surface (perhaps a rather 
confusing name in the present context) and it is found to be a double surface 
or surface of two sheets, corresponding to two oppositely polarized 
wavefronts travelling at different velocities. 

Consider, for example, calcite (Iceland spar), which is crystalline CaCO3. 
We suppose that by some means it is possible to produce a point source of 
light inside the crystal, as in Fig. 4.5. There are two parts to the propagating 


Unpolarized 


Optic-axis 
direction 


o ray 

Retardation SE o wavefront 
e wavefront 

FIG. 4.5. Double refraction or birefringence in a crystal of calcite (CaCOs). The 

angle between the o and e rays is exaggerated. The circles indicate that the electric field 


is perpendicular to the plane of the diagram. 


disturbance, one a spherical wavefront (the o disturbance, wavefront, or 
beam) and the other an oblate ellipsoid of revolution (the e disturbance). The 
y accordingly to Huygens’ principle, and 
they are polarized at right-angles as indicated. Thus if a collimated beam is 
incident normally on the crystal, the o disturbance will pass through 
undeviated, but if we propagate the e disturbance as in Fig. 2.3), by Huygens’ 
principle we find it is deviated as shown.+ There is one direction in the crystal, 
the direction of the optic axis, such that both disturbances travel in the same 


two systems propagate independent! 


+The o and e rays were originally called the ordinary and extraordinary rays on 


account of this behaviour. 
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direction and with the same velocity. From the figure this is clearly the axis of 
revolution of the ellipsoid. 

In most crystals the ray surface is more complicated than as described 
above, and it is better to consider propagation of plane waves rather than 
light from a fictitious point source in the crystal. In general, corresponding to 
any unpolarized plane wave incident on the crystal from air, there are two 
plane-polarized waves propagated inside the crystal in different directions 
and with different speeds. Two beams plane-polarized at right-angles and 
parallel in direction emerge from the crystal, as in Fig. 4.5. The lateral 
displacement between the beams is used in some polarizing devices, but a 
more important effect is a relative retardation or optical path difference 
between the two emergent wavefronts. The retardation is used to produce 
elliptic or circular polarization, as in Fig. 4.6. Plane-polarized light, with its 
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Isotropic materials, e.g. glasses, become double refracting or birefringent, 
like crystals, when under mechanical stress or in static electric or magnetic 
fields. Materials such as stretched plastic sheet (almost any kind) are found to 
be birefringent on inspection between polarizers. These materials contain 
long polymer molecules which acquire a partial common alignment from the 
stretching, and thus they are similar to crystals. If a chromophore is added to 
the polymer it may absorb one type of polarization and transmit the other; 
this is the principle of the commonest kind of ‘Polaroid’, which is a poly(vinyl 
alcohol)-iodine complex. 

The effects of electric and magnetic fields in producing birefringence have 
applications in modern optics, e.g. in modulating the intensity in a light beam 
according to an electrical signal. There is a second group of effects in which a 
medium rotates the plane of polarization of incident plane-polarized light. 
This effect is intrinsic i.e., does not depend on electric or magnetic fields in 
certain solutions of molecules which form stereoisomers, i.e. the molecule and 
its mirror image cannot be superimposed. A simple example is lactic acid, 
(CH,)CH(OH).CO2H, in which the central carbon atom is bonded to four 
different groups—methyl, hydrogen, hydroxyl, and carboxyl—so that this 
structure cannot match its mirror image. Many organic compounds have this 
property, including some sugars, and an important method, saccharimetry, of 
estimating sugar concentration is based on it. Rotation of the plane of 
polarization can be induced in almost all materials by a magnetic field. For 
transparent materials this rotation is called the Faraday effect. This effect has 
been used, for example, for estimating magnetic fields in space and for 
measuring very large direct currents (by estimating the surrounding magnetic 
field). Magnetic rotation also occurs on reflection from metal surfaces, when 
it is called the Kerr effect. This effect is used in studying the microstructure of 
magnetic alloys with a polarizing microscope. 7 

Mathematical formulations and a detailed treatment of crystal optics and 
of electro- and magnetooptical effects are given elsewhere (e.g. Born and Wolf 


1965). 


4.4, Polarization and interference 


According to the discussion in Section 4.1 we must include the state of 
polarization in any precise discussion of coherence. If the interference 
experiment as in Fig. 3.1 is carried out with an unpolarized source, we find 
interference fringes as expected. However, if we polarize each interfering 
beam separately in orthogonal directions there is no interference, and we 
cannot produce interference by then rotating one of the planes of polarization 
to agree with the other by means of, for example, a half-wave plate in one 
beam (see Problem 4.6). We interpret this to mean that unpolarized light is to 
be regarded as the sum of two plane-polarized and mutually incoherent 
components which are orthogonal, i.e. their planes of polarization are at 
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FIG. 4.7. Orthogonal plane-polarized components. They are mutually incoherent if 
selected from an initially unpolarized beam. 
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4.4. Prove that at Brewster incidence the reflected and refracted rays are mutually 
perpendicular. 

4.5. Calcite has n, = 1.659,n, = 1.487. Calculate the thickness of a quarter-wave plate 
for wavelength 589 nm. 

4.6. Show that the effect of a half-wave plate with its axis at an angle 0 to the 
polarization direction of plane-polarized light is to rotate the plane of polarization 
through 20. (Resolve the incident field parallel and perpendicular to the plate axis.) 


5. Image-forming instruments 


Must get those old glasses of mine set right. Goerz lenses, six guineas. 
James Joyce: Ulysses 


5.1. Instrument design 


In the design of an optical system 
factors may be important: (1) light. 
bright image; (2) magnification; ( 
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5.2. Telescopes 


Figure 5.1 shows the essentials of a refracting astronomical telescope. An 
objective lens, of diameter D and focal length f}, 


objects at its focal plai 


forms a real image of stars 


and other astronomical ne. If the angle subtended 
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between two of the stars is £ then from Chapter 2 their images are separated 
by a distance Bf,. These real images are recorded by a photographic plate or 
other detecting system, or alternatively they may be viewed by the eyepiece. 
The eyepiece is a system of focal length f. It is used with the object, i.e. the Teal 
image of the star field, at its first focal plane. Thus it forms an image of the 
Stars at infinity, so that they are viewed by the relaxed eye (probably equipped 
with glasses in the case of an elderly astronomer). The pair of stars now 
subtends an angle Bf,/f, so that the angular magnification is f,/ f2. Thus for 
physical detectors the magnification is determined by a scale factor, 
according to which an angle f between objects at infinity corresponds to a 
distance Bf, in the image plane, whereas for visual observation we use the 
angular magnification f,/ f2- 

The light-gathering power depends on conflicting factors. Image-recording 
detectors, in the sense used in Section 1.4, all have a certain minimum size of 
image which they can ‘see’. Thus, suppose that by some means we produce an 
extremely small point image, say 250nm across, on a photographic 
emulsion.} The image patch recorded by the emulsion will actually be much 
larger, owing to scattering and diffusion of light in the emulsion and other 
effects. Such an image produced by a negligibly small light patch is called a 
point spread function, and its size can be used to characterize the limiting 
performance of the detector. The concept is applicable to all detectors. In the 
normal human eye the point spread function projected back into the outside 
world corresponds to an angular subtense of about 0.0003 rad or a distance of 
1 mm at 3 m; for a television camera it corresponds to the scan-line size; for 
photographic emulsions it can range from 500 nm (for special emulsions for 
holography or spectroscopy) to about 0.02 mm for very high-speed pan- 
chromatic emulsions. 

Returning now to the telescope, if the star image is smaller than the point 
spread function of the emulsion, the light-gathering power must be propor- 
tional to D?, since it is simply a question of the total of light flux collected. On 
the other hand, if the image is larger than the point spread function, as in the 
case of a planet or a nebula, the light-gathering power is measured by the flux 
per unit area falling on the emulsion, and then from Section 2.6 it is 
proportional to D?/f?. In considering light-gathering power for visual 
observation we have to ask whether the pupil of the eye admits all the light 
collected by the telescope. It can be seen by following rays through the system 
that the eyepiece forms an image of the aperture of the objective at a point to 
the right of the whole system where pencils from a star away from the axis 
cross the axis. In this context the aperture of the objective is the entrance pupil, 
and this image of it is the exit pupil. Clearly the eye must have its pupil roughly 
at the exit pupil of the telescope in order to see all the field of view. Thus the 
light-gathering power for visual purposes depends (1) on whether the exit 
+ This is about the smallest point image which can be produced in the optical region, as 


will be seen in Section 5.4. 
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pupil of the instrument is larger or smaller than that of the eye, and (2) on 
whether the star images are larger or smaller than the point spread function of 
the eye. This argument is taken further elsewhere (see e.g. Chapter 9 of 
Welford (1962)). The above discussion shows that the photometry of optical 
systems can be a complex topic. 

A classical problem in astronomy is the resolution of double stars or similar 
close objects. On the basis of geometrical optics alone there need be no limit 
to resolving power; we merely have to make a telescope with adequate 
magnification and light-gathering power and with perfectly corrected 
aberrations. However, according to physical optics, there is a limit. We regard 
the two stars to be resolved as point sources of equal brightness and not, of 
course, coherent with each other, since they are in fact separate thermal light 
sources. The telescope aperture, i.e. the rim of the objective, limits the size of 
plane wavefronts accepted from one of the stars and thus diffraction occurs at 
the aperture. The objective itself then brings the far-field diffraction pattern to 
a convenient place for viewing—the focal plane. In other words, the image of 
a point object according to physical optics is the far-field diffraction pattern of 
the aperture of the objective. We have already described this image in Section 
3.4 (eqn (3.16) and Fig. 3.13), and we know that it is a diffuse patch of light of 
which the angular size depends on the diameter ofthe diffracting aperture and 
on the wavelength of the light. From the present point of view we can call it 
the Point spread function of the objective. Obviously the resolution of the 
optical system, leaving aside the effect of the detector, is determined by the size 
of this Point spread function, since the image of the double star consists of two 
overlapping point spread functions. From the discussion in Section 3.4 the 
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FIG. 5.2. Two star images of equal intensity which are conventionally a just- 
resolvable distance apart. The angular separation of the stars is 1.22 2/D where Dis the 
pemicter of the telescope objective. The sum of the two images is indicated in broken 
ine. 


The above discussion shows how the factors listed in Section 5.1 can enter 
into any given problem. To summarize: (1) we must have enough light- 
gathering power to record the required event in a suitable time, e.g. up to 
several hours in astronomy, less than a nanosecond in the study of pulsed 
lasers, or anything in between; (2) there must be adequate magnification to 
ensure that the detector can separate all the detail in the image which is (3) 
resolved by the optical system. We must always make a clear distinction 
between (2) and (3). For a telescope the resolution depends only on the 
diameter of the objective and it is independent of the focal length. On the 
other hand, again in the case of the telescope, the scale of the star picture on 
the photographic plate depends on the focal length but not on the diameter of 
the objective. A similar distinction can be drawn for most optical instruments. 

Light-gathering power is the most important factor in modern astronomy. 
For technical reasons concerned with chromatic aberration and with the 
manufacture of optical glass all large telescopes have mirror objectives. 
Figures 5.3-5.5 show three widely used types. The Newtonian telescope 
(Fig. 5.3) is simply a large concave paraboloid of revolution. It is easily shown 
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Image plane 


FIG. 5.3. The Newtonian telescope. The photographic plate or other detector is 
placed at the image plane. (Newton’s instrument was, of course, used visually with a 
small plane mirror to permit viewing from the side of the telescope tube: modern 
telescopes with a paraboloidal primary mirror are always called Newtonian.) 
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FIG. 5.4. The Cassegrain telescope. The convex mirror, called the secondary, has a 
hyperboloidal shape if the primary is a paraboloid. The system has a relatively long 
focal ratio (F/8 to F/11) to match a Spectroscopic system attached to the telescope. 
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Parallel to the axis exactly to a point focus at the 
(geometrical) focus of the generating parabola; the detector, e.g. a photo- 
graphic plate or a photoelectric image-intensifier, is Placed at the image plane. 
The Newtonian is the system with greatest speed for direct image recording, 
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(Fig. 5.5) can have an angular field of a few degrees at about F/2.5 for 
apertures exceeding | m; it is normally used photographically (and therefore 
usually called the Schmidt camera) for rapid, large-scale surveys of the sky. 


5.3. The human eye 


We have to describe the eye both as an optical system and as a detector, since 
it is used together with other optical systems. Figure 5.6 is a very simplified 
diagram of the eye. Most of the refracting power is in the front surface of the 
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FIG. 5.6. The human eye. This is a very schematic and simplified diagram. Most of 
the lens effect is due to the sharply curved front surface of the eye, the cornea. The iris is 
the aperture stop of the eye. 


cornea, and the main function of the lens is to vary the focus so as to be able to 
see clearly over a range of distances. Thus the eye is like a camera, forming an 
image ofa distant scene on a light-sensitive surface, the retina. The retina is an 
array of light-sensitive cells which communicate with the brain via a complex 
network of interconnected nerve cells. The normal eye can adjust its focus or 
accommodate to form sharp images of objects at distances from infinity to 
about 250 mm.+ The accommodation is not done as in a camera, by changing 
the distance between the lens and the detector, but by varying the lens 
curvatures by muscular control. The iris varies in diameter between about 
2mm and 8 mm, depending on the average brightness of the scene being 
viewed. The sensitivity of the visual channel from the retina to the brain also 
varies (adaptation), so that the eye can be used over a very wide range of light 
intensities, more than 8 orders of magnitude for diffusely illuminated scenes. 
A very faint flash consisting of only a few photons of green light striking a 
single receptor in the retina can be detected under suitable conditions, and at 
the other end of the scale a flux of 0-1 mW on a single receptor can be 
tolerated for about 0.1 s, so that on this basis the range of sensitivity of the eye 
is about 14 orders of magnitude. This range is obtained, as in most forms of 


+ But the range of accommodation decreases in old age. Many people have a different 
range, and they have to wear glasses in order to add or subtract power to shift the range 
to the ‘normal’. Children can often accommodate to much closer than 250 mm. 


74 Image-forming instruments 


sensory perception, by an approximately logarithmic response, ie. a given 
difference in sensation corresponds to the same ratio between light signals at 
any part of the range. In many physical measurements a linear relation 
between the quantity measured and the indication is desired, but the great 
compression of range produced by a logarithmic response is often very useful. 

As noted in Section 5.2, the angular resolution of the eye is about 1 min of 
arc (0.0003 rad), depending on the conditions. This figure is used in 
determining the required magnification of an optical instrument. Thus if a 
telescope can resolve detail of 1 sec of arc we have to make the eyepiece 
magnify this detail enough to subtend 1 min to the eye (in practice 2 or 3 times 
more). The magnification is calculated as in Section 5.2. 

We must stress that the above description of the eye is very incomplete and 
lacking in detail. The responses of the eye to varying light levels, to fine detail, 
and to different wavelengths are very complicated, and are by no means fully 


understood. Our description is intended merely as a sketch to suggest how the 
eye is coupled to other optical systems. 


5.4. The microscope 


A microscope is essentially an elaborate magnifying glass. If we use a lens of 
focal length fto form an image at infinity of an object of size n (as in Fig. 5.7), 
the magnified object appears to subtend an angle n/ f. On the other hand, if we 
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A magnifying glass. The object is (approximately) at the front focus, so that 
seen by the relaxed eye at infinity and it appears to subtend an angle n/ f. In 
magnification is not very dependent on the exact Position of the object. 
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as possible to resolve small detail, since the size of the point spread function is 
proportional to f/a. In fact it is found (Welford 1962) that it is the sine of this 
angle which matters, and the accepted form of the resolution limit is 


Nin = 0-54/sin a. (5.2) 


The precise value of the numerical factor depends in a complicated way on the 
conditions of illumination of the object and on just how we define ‘resolution’, 
but the value 0.5 is adequate for most practical purposes. 

Since the diameter 2a must be less than, say, 4 mm in order to match the 
pupil of the eye, it can be seen that we have to use very short focal length lenses 
in order to get high resolving power, and focal lengths of 2 mm or less are used 
with angles « up to about 60°. The lenses have to have many components to 
keep the aberrations small, and it is then found to be impossible to get the eye 
close enough to the lens to see a reasonable field of view. Thus the so-called 
compound microscope was developed, as in Fig. 5.8. An objective with much 
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Object plane Primary 
image plane 
FIG. 5.8. Principle of the compound microscope. The objective has a very small 
focal length so that the primary image is highly magnified, and it has a large collecting 
angle « for high resolution. The primary image may be recorded directly on a physical 
detector, e.g. a photographic emulsion, or it may be viewed through an eyepiece. 


the same characteristics as we postulated for the magnifier in Fig. 5.7 is used 
to form an enlarged real image of the object. This real image can be recorded 
any other physical detector, just as for the 


photographically or by means of 

primary image in an astronomical telescope; or alternatively we can form a 
virtual image of it at infinity with an eyepiece and simply look at that, as in 
conventional microscopy. Whatever mode of detection is used we again note 
that the function of resolution depends on the objective and its collecting 
angle, but not on the details of the optical system which follows. Since the 
wavelength appears in eqn (5.2) in the numerator we can also gain resolution 
by going to shorter wavelengths. Little has been done with ultraviolet-light 
microscopy, but great gains have been made in electron microscopy. The de 
Broglie wavelength of electrons of 1MeV energy is about 107'?m 
(Radiation and quantum physics, OPS 3) and the collecting angle of electron 
lenses used as microscope objectives is of the order of 107°, so that the latest 
electron microscopes are approaching the resolution of intermolecular 
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distances. However, the ccllecting angle is limited to that small value by 
aberrations which seem to be in principle irreducible, whereas optical 
microscope objectives can be made practically aberration-free for values ofa 
up to 60°. 

In eqn (5.2) Ain the numerator is the wavelength in the medium containing 
the object. Thus if the object is embedded in a medium of refractive index n 


then 2 = A/n, where A is the vacuum wavelength of the light. Equation (5.2) 
then takes the form 


resolution limit nmin = 0.529/n sin æ. (5.2a) 


and we see that a gain in resolution is obtained by so embedding or immersing 
the object in a medium of high refractive index. This is the principle behind 
the oil-immersion microscope objective. The quantity n sin « is called the 


numerical aperture (NA), and it is quoted on microscope objectives as a 
measure of resolving power. 


5.5 Images of extended objects 


ds to justify taking the 
point spread function as the ideal Airy pattern (Fig. 3.13). This is not so with 
film or television cameras, and then 
berrations can be very different from 


a convergence angle of about 
f-width, and this would be 


Suggested in the figure. 
We can obtain an ex 
(3.11b), since in the p 


ane and fis the focal length. 
F(x, y), which defines the. 
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FIG. 5.9. A point spread function. The height of the solid at (č, n) is proportional to 
the light intensity in the point spread function. 


aberrations. Then if I(č, 7) is the light-intensity distribution in the point 
spread function we have, from eqn (3.11b) 
I 1 FF E(x, y) exp 12% (ex + ny) axal. 53 
=|— , y) exp 4—7 (2 yp dx dy} . B 
Em =la [f Fess exp yap Gx + mp dx dh (5.3) 


~ x 
We cannot evaluate this expression until we know the form of the pupil 
function F(x, y), and often the integration has to be done numerically, but we 
suppose this to be done. 

We next consider how the image of an extended object such as a bright disc 
or square is built up from individual point spread functions. We assume the 
object to be incoherently illuminated, i.e. the light from any one point of the 
object cannot interfere with that from any other point, so that we obtain the 
effect of overlapping point spread functions by adding their intensities. This 
will be so if the object is self-luminous, e.g. a hot filament or a gas discharge, or 
if it is illuminated by a non-monochromatic source of large enough size. 
Suppose then that the distribution of light intensity in the object is given by 
Olé, n). We use the same coordinates (č, 1) as in the image space to denote 
points which are object and image in Gaussian approximation, by simply 
rescaling the object coordinates according to the magnification of the optical 
system. An element of the object, say O(¢', n°) dé’ dy’, produces a point spread 
function which redistributes the light from the element over the image plane. 
Thus a point P(č, n) receives light from all the spread functions, and in 
particular from the spread function at (č', n’) it receives the contribution 


Me — č’, n — nO’, n’) de’ dn’, 


as in Fig. 5.10. The total effect at P is obtained by summing over all the points 
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FIG. 5.10. The image of an extended object. The point spread function centred at 
P'(č', n’) contributes some light intensity at P (¢, 7). 


(č', n’) of the object; then provided the spread function has the same form at 
all these points we have 


o'l, n) = fre = En = nO’, n’) ae’ dy’, (5.4) 


and this is the light-intensity distribution in the image O'(č, 7) of the object 
O(č, n). Thus we see that the image is obtained as the convolution of the 
object and the point spread function (see Appendix). Convolution is the 
mathematical representation of a physical process in which a sharply defined 
input is spread to produce a blurred output. For example, in a communi- 
cation channel the output signal is the convolution of the input signal and the 
impulse response. This latter is the response of the channel to a delta-function 
input signal, and so it corresponds to the point spread function in an optical 
system. 

We can formally write the object and image intensity distributions O(E,n) 
and O'(č, n) and the point spread function 1(€,) as the inverse Fourier 
transforms of certain functions o(s, t), o'(s, t), and L(s, t), 


Olé, n) = fois; t) exp fi2n(sé + tn)} ds dt, (5.5) 
OE) = ffos, t) exp {i2n(sč + tn)} ds dt, (5.6) 
In) = frs, t) exp {i2n(së + tn)} ds dt, (5.7) 


and the convolution theorem (Appendix) tells us that 


o'(s, t) = o(s, t)L(s, t). 
The physical significance of e 
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the x- and y-directions has the amplitude o(s, t) ds dt. Then according to eqn 
(5.8) the amplitude of the periodic component in the image having the same 
pair of spatial frequencies is obtained by multiplying the amplitude of the 
object component by the factor L(s, t), the Fourier transform of the point 
spread function (eqn (5.7)). This function L(s, t) is called the optical transfer 
function, usually abbreviated to OTF, and its role in an optical system is 
analogous to that of the transfer function of an electrical channel such as an 
amplifier. The form of the OTF, i.e. its numerical values, depends on the form 
of the point spread function, but it can be shown that whatever the form of the 
OTF a single-frequency object or sinusoidal grating forms a similar (i.e. also 
sinusoidal) image, but with different contrast. The reduction in contrast as a 
function of spatial frequency is used as a measure of quality for optical 
systems which are to be used for imaging extended objects in incoherent 
illumination. Figure 5.11 shows how images of sinusoidal objects of different 
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FIG. 5.11. Contrast transfer for sinusoidal objects. (a) 0 
duction in contrast of the 


high spatial frequencies respectively, showing greater re 
image, i.e. smaller relative swing in intensity, for (b). 


med. The contrast is usually less for higher spatial 
frequencies and there is no image contrast at all, i.e. no modulation, for spatial 
frequencies above a certain limit corresponding roughly to a grating with 
lines spaced at the resolution limit (eqn (5.2)). 


spatial frequencies are for: 


Problems 


5.1. What is the theoretical angular resolution of telescopes with objective diameters of 


100 mm, 1 m, and 5 m? 

5.2. A telescope with a 50 mm aper 
stellar photography. Estimate t 
plate scale in radians per millimetre. 


ture and 500 mm focal length objective is used for 
he size of a star image on the plate and calculate the 
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5.4. 


SS) 


5.6. 
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The telescope in Problem 5.2 is to be used visually. What focal length would be 
required for the eyepiece in order to take advantage of the theoretical resolution of 
the objective? 

Plot a graph of the light intensity across the centre line of two star images of equal 


intensities when the centres are separated by a distance equal to the radius of the 
first dark ring of the Airy pattern. 


What is the resolution limit of a microscope objective of numerical aperture 0.65, 
and what overall magnification of the microscope is needed to take full advantage 
of this resolution? 


Suggest a microscope NA and a ma; 


blood cells 7 um in diameter 
photographic emulsion. 


gnification suitable for visual study of (a) red 
and (b) grains of silver halide 1 um in size in a 


6. Interferometers and spectroscopes 


Pan H are longest. Roygbiv Vance taught us: red, orange, yellow, green, blue, indigo, 
violet. 
James Joyce: Ulysses 


6.1. Young’s experiment; spatial coherence 


Figure 6.1 shows a simple way to produce interference effects. There are two 
pinholes in a screen placed in a collimated beam; the light from each pinhole 
spreads by diffraction into a cone of semiangle approximately À/d, where d is 
the pinhole diameter, (see Section 3.4). Thus the pinholes are secondary 
sources producing diverging spherical wavefronts; these interfere where they 
overlap, since they have come from the same original source, and they 
produce interference fringes as in the figure. A simplified version of this 
experiment was carried out by Thomas Young in 1804. The experiment led to 
the general acceptance of the theory that light is a wave phenomenon. 
Young’s experiment can be used to illustrate the concept of coherence 
between light beams. If the source in Fig. 6.1 is a helium-neon laser it is not 
essential to have a pinhole at the focus of the collimator and the fringes will 
have good contrast or visibility if the two secondary pinholes are equal in 
diameter. If we use a thermal source such as a sodium lamp we find 
experimentally that the collimator pinhole must be restricted in size for 
fringes of good contrast to be formed. We can see why this is by an argument 
similar to that in Section 3.1. Let the source pinhole have diameter d, let the 


Fringes 


d 
LEM i 


TE, = È 
Young's interference experiment. The collimator is 


FIG. 6.1. A version of Thomas : : : 
relationships between the two pinholes 


not essential. It is put in to make the phase 
clearer. 
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collimator focal length be f, and let the two secondary pinholes bea distance a 
apart. The spacing of the fringes formed by light from a given point in the 
source is AL/a, where 4 is the wavelength and L is the distance from the 
pinholes at which the fringes are observed. The phase difference at the screen 
between disturbances from source points on either side of the pinhole is 
(2n/A)ad/f, so that the different source points form fringe systems displaced 
laterally by the fraction of a fringe ad//f, as in Fig. 6.2. Thus if ad/fis of order 


XN AAS 
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FIG.6.2.  Superposed fringes produced in Young's experiment by 
finite size. 


asource pinhole of 


of magnitude unity or greater there will be more or less uniform illumination, 
i.e. no fringes will be seen. In other words, we must have 

ad/f < 2/4 (6.1) 
for fringes of good contrast to be formed. 


This equation has more than one interpretation. It tells us how small the 
source pinhole must be in order to get good fringe contrast: it must be smaller 
than 2f/4a, and it is then said to be a ‘diffraction-limited pinhole’. Equation 

Ween points on the screen within which the 


h - We see from eqn (6.1) that as the source gets 
bigger the coherence patch gets smaller. 
We can use eqn (6.1) in yet 
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FIG.6.3.  Michelson’s stellar interferometer. Beams a distance a apart on the ground 
but coming from the same star are made to interfere. The details of the interferometer 
do not matter in principle. The contrast of the interference fringes depends on the 
angular diameter of the star. 


when ais approximately 7/B, or, more precisely for astar of circular shape and 
uniform brightness, 1.224/2. In Michelson’s instrument the mirror separation 
acould take values up to about 6 m. The details of the arrangement by which 
the interference fringes are produced are in principle irrelevant to the 
measurement: we merely have to make the beams interfere. The same 
principle is applied in measuring the diameter of radio stars, but the 
‘interference’ is arranged by mixing the radio signals collected by antennae at 
suitably variable spacings. The wavelengths are in the centimetre range, and 
the separations are of the order of a kilometre. 


6.2. Michelson’s interferometer; temporal coherence 

d briefly interference between beams reflected at the 
two surfaces of an oil film on water or a similar thin layer. The detailed theory 
of these effects is complicated by multiple reflections to and fro in the film. The 
principles can be seen more clearly in an apparently more complicated 
apparatus, Michelson’s interferometer (not to be confused with his stellar 
interferometer), shown in outline in Fig. 6.4. In principle, this is merely a 
device for studying interference between coherent beams reflected from two 
parallel or nearly parallel surfaces, but in order to avoid multiple reflections 
the surfaces are not placed almost in contact, as in the oil film. The surfaces 
are the two plane mirrors My and M3. The beam-splitter, or semi-reflecting 
and semi-transmitting mirror, is adjusted to make the image of M, appear 


In Section 3.1 we discusse 
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Beam-splitter 


FIG. 6.4. Michelson’s interferometer (not to be confused with the stellar inter- 
ferometer of Fig. 6.3), M; is the apparent position of M, as seen in the beam-splitter. 


parallel to M, and at a distance z from it, as seen by a detector (e.g. the eye) 
looking into the system as shown. We consider interference between 
collimated beams from a source at infinity having a finite angular subtense; 
this could be arranged as in Fig. 6.5, where the source, e.g. a mercury lamp, is 
at the focus of a collimator of focal length f, and has a diameter 2a. The 
interference effects are to be observed at the far-field, since this is where the 
individual source points are imaged, and for this we use an objective of focal 
length f, and observe at its focal plane. 


A point of the source at ad 


istance p from the axis of the collimator 
prod 


uces a collimated beam inclined at an angle p/f, to the axis of the 
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ferometer. 


FIG. 6.5.. Formation of circular fringes in Michelson’s inter: 
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interferometer. This beam therefore meets an effective plate of air of thickness 
z at an angle of incidence p/f,, and from eqn (3.4) the optical path difference 
between the two beams reflected back from M, and the image of M, is 


W = 2z cos (p/f,). (6.2) 


At the centre of the far-field pattern this path difference has the value 2z, 
and if this is an integral number of wavelengths the centre will be bright.+ 
Going out from the centre, W decreases because of the cosine factor in eqn 
(6.2), and each time it decreases by one wavelength we reach another bright 
fringe. The fringes must be circular, since W depends only on the angle of 
incidence p/f,, not on the azimuthal angle. It is easy to show that the radius of 
the N" fringe is proportional to N™?, as for Newton’s rings. 

Consider only the centre of the fringe system 0’, and suppose the distance z 
to be varied steadily. Then, as the path difference W = 2z varies, 0’ will be 
alternately bright and dark, and the intensity there will be proportional to 


1 + cos (4nzv/c), (6.3) 


where v is the frequency of the light. A small enough detector at 0’ would 
record this as a fringe pattern or fringe function in the variable z. 

So far we have assumed the light to be monochromatic with frequency v. If 
this is not so (e.g. we might be using a source with a broad spectrum line, such 
as a high-pressure mercury lamp, or perhaps a continuous source, such as a 
filament lamp) we suppose that the proportion of power in the light beam 
between frequencies v and v + dv is G(v) dy. Thus G(v) is proportional to the 
light intensity seen through a prism or other spectroscopic system. The fringe 


function for the frequency band dv is then 

G(v){1 + cos (4nzv/c)} dv 
and the total fringe function for the light of all frequencies added together is 
obtained by integrating with respect to v, 


gz) = few {1 + cos (4nzv/c)} dv. (6.4) 


The physical meaning of this equation is that we are adding together 
individual fringe functions of different fringe spacings. The fringe spacing in 
the z domain for frequency vis ¢/2v. Thus these fringe systems start in phase at 
zero path difference (z = 0) and they gradually get out of phase as z increases, 
so that the contrast of the fringes falls. For nearly monochromatic light, i.e. a 
small range of frequencies, the contrast is good over a large path difference, 
and for ‘white’ light only a few fringes can be detected with measurable 
contrast. Thus the interferometer can be used to estimate the narrowness of a 
spectrum line. Michelson himself used it in this way in 1892 to show that the 


+ Here, as elsewhere, we ignore complications due to phase-change effects on reflection 


(see Section 3.1). 
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red cadmium line of wavelength 643.8 nm is very narrow and is therefore 
suitable for standardizing the metre in terms of wavelengths. 

Returning to eqn (6.4) we see that the right-hand side is the sum of a 
constant (the integral over the spectrum of G(v)) and the cosine Fourier 
transform of G(v). Thus if the fringe function g(z) is recorded we can obtain 
the spectrum of the light by calculating the Fourier transform. This is the 
principle of Fourier-transform spectroscopy, wich has become a standard 
technique in many fields within the last two decades. Michelson actually 
determined several spectrum line profiles in this way over 80 years ago. 

The fall in contrast of the fringe function with increase of path difference 2z 
can be regarded also as decreasing coherence between the beams returning 
from the mirrors. Thus the path difference at which the contrast falls to some 
chosen value is a measure of the coherence length (Section 1.5) and the 
corresponding coherence time is 2z/c. These are therefore measures of the 
length of wave-train which is reasonably correlated with itself or which is 
approximately sinusoidal with the same frequency and amplitude along its 
length. Thus coherence length and spectral composition are two different 
aspects of the same physical Phenomenon. 

The Michelson interferometer of Figs. 6.4 and 6.5 h 
applications (see e.g. Born and Wolf 1965), but that which w 
has the greatest importance in basic physics. 


as many other 
e have described 


6.3. Prisms and gratings as dispersing elements 
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slit spectrum 
FIG. 6.6. Principle of the prism spectrograph. 
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consists of a slit at the focus of the collimator to define the direction of the 
incident beam, the dispersing prism, the camera objective to focus the 
dispersed beams, and the photographic plate or other image-recording 
detector. Figure 6.6 shows only the principle of the simplest kind of 
spectrograph; there are many variations for different purposes. 

The mode of action of a diffraction grating is not quite so obvious as that 
of a prism. Figure 6.7 shows a grating consisting of narrow slits in an opaque 


Incident 
wavefront / 


ZN 
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FIG. 6.7. First-order diffracted wavefronts as the envelopes of secondary waves 
from the slits or ‘rulings’ of a diffraction grating. 


screen with spacing o between the slit centres. If a collimated beam of light is 


incident normally from the left each slit diffracts light as in Chapter 3, and, if 
the slits are narrow enough, the diffracted light spreads out over a range of 


angles, as indicated. The diffracted beams from each slit interefere, since they 


originated from the same collimated beam. In the far-field all diffracted beams 
iginal incident beam, but there can be 


will be in phase in the direction of the ori a t c 
other directions in the far-field in which beams from neighbouring slits are 
one wavelength out of phase with each other. Such a diffracted beam is 
indicated in the figure. EA . 

Figure 6.8 shows how we can calculate the directions of this and other 
diffracted beams: the ray and wavefronts are indicated in inverted commas 
because they do not exist in the near field, but the construction gives the 
direction of the diffracted ray and wavefronts. For the direction &«' we must 


have 
sin «' = A/o. (6.5) 


There may also be other diffracted beams with two, three, or more 
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FIG.6.8. Calculation of the angle of diffraction. The ‘tay’ is the common normal to 


the ‘wavefronts’, and these are the envelopes of the actual diffracted wavefronts, as in 
Fig. 6.7. 


T Ray” 


wavelengths path difference between the waves from successive slits, and for 
these we should have 


sin a’ = M2/øo (M an integer). (6.6) 


Since in eqns (6.5) and (6.6) the angle of diffraction depends on the 
wavelength, a collimated beam of white light incident on the grating will be 
spread into a spectrum. In fact there will be several spectra corresponding to 
the different orders M, and there will be an undispersed zero order, the light 
which travels on undeviated. This is indicated in Fig. 6.9. 


Incident beam 


Grating 0 PE o 


FIG. 69. Spectra of different orders formed by a grating: A, is greater than Ap. 
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Diffracted ray 


Incident ray 


Normal 


Grating 


FIG. 6.10. A plane reflection grating. 


rather than in transmission, as in Fig. 6.10. By an extension of the argument 
used above we can show that for angles of incidence and diffraction « and o 
the relation corresponding to eqn (6.6) is 


sina + sina’ = Mi/o. (6.7) 


This equation gives the direction a’ in which the Mth-order diffracted beam of 
wavelength 2 goes for an angle of incidence 4. Obviously it reduces to the 
ordinary law of reflection for the zero-order beam. 

Many grating spectroscopes use reflecting collimators and objectives 
rather than lenses because of the better aberration correction (no chromatic 
aberration) and greater wavelength range which can be used with mirrors. 
Figure 6.11 shows one simple design of monochromator, i.e. a system with a 


Collimator, 
and objective 
mirror 
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rotated about an axis normal 


FIG. 6.11. A grating monochromator. The grating is abou 
ss the exit slit. 


to the plane of the diagram to scan the spectrum acro! 


fferent wavelengths are scanned 
ed. If the grating is formed on a 
ties of the concave mirror are 


second slit at the plane of the spectrum. Di 
across the exit slit when the grating is rotat 
concave surface the image-forming proper 4 
combined with the dispersion of the grating, and the spectrum is formed and 
focused by a single element, the concave grating, as in Fig. 6.12. Ifa complete 
spectrum is recorded photographically, as in the figure, the instrument 


becomes a grating spectrograph. 
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Grating D 


FIG. 6.12. The concave diffraction grating arranged as a spectrograph. 


Spectrum 


6.4. Dispersion, resolution, and light-gathering power of prisms and 
gratings 


The light-gathering power of spectroscopic systems depends on the area of 
the entrance slit and on the angular subtense of the collimator aperture in 
much the same way as for ordinary image-forming systems, but it also 
depends on the dispersion, i.e. the angular separation in the far-field per unit 
wavelength or frequency interval. The light-gathering power also depends on 
the nature of the detector, i.e. whether this is an image-recording system, such 
as a photographic emulsion in a spectrograph, or a total flux collector, such as 
a photomultiplier at the exit slit of a monochromator. In the simplest case, a 
monochromator with entrance and exit slits of equal width and with a 
dispersing element of area A, the flux transmitted in the wavelength interval 
62 is proportional to BdxA6A, where dz is the angular subtense of either slit 


along the direction of dispersion and is the angular subtense of the height of 
either slit. This can be written 


da 
Boar A a ba, (6.8) 


where d//de’ refers to the dis; 
for light-gathering power 


angular dispersion, ie. where the different wavelengths in the light to be 
analysed are sent in differen 


grating to be used. 
We calculate the an 


gular dispersion ofa grating by differentiating eqn (6.7) 
with respect to a’, 


di o 5 
w mA (6.9) 
Thus for small angles of diffraction the dispersion ofa grating is almost linear, 


since cos 2’ ~ 1 for small a’, ie. the wavelength found in the spectrum is 
directly proportional to the angle of diffraction. 


Figure 6.13 shows the notation used for deriving the formula for the 
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FIG. 6.13. Notation for calculating the dispersion of a prism. 


dispersion of a prism. By differentiating Snell’s law for both faces and 
eliminating the internal angles of incidence it can be shown that 

da} sin A 

dn cos 2’ cos æ,’ 


or 


da’, sin A dn (6.10) 

då cos a, cosa, di’ 3 
where dn/då4 is the dispersion of the prism material. It can also be shown that 
in the symmetrical position, with 2, = %, the total angular deviation of the 
beam is a minimum, and for this minimum deviation position we have 


da, _ 2sin $A dn (6.11) 
då cosa, då 


For both the diffraction grating and the prism the final image of a point in the 
entrance slit is formed as the far-field diffraction pattern of an aperture which 
may be either the rectangular outline of the prism or grating itself or the 
aperture of a lens or mirror used to bring the far-field pattern to a focus. Thus 
an indefinitely narrow entrance slit illuminated with perfectly monochro- 
matic light would still produce a spectrum line of a certain finite angular 
subtense. This, by the same reasoning as we used in Section 5.2, must be of 


order of magnitude 4/D, where D is the width of the prism or grating aperture 


across the direction of dispersion. The criterion for the just-resolvable 
Separation of two wavelengths 4 and 4 + 62 therefore that their directions 
shall be separated by this angle, as in Fig. 6.14. The resolving power of a 
dispersing element is conventionally defined as the number 


ìjà, (6.12) 
and we can find this for a grating by putting dz’ = 2/D in eqn (6.9). We obtain 
ies MD 
SA acosa’ 


but, recalling that D is measured across the width of the beam diffracted to the 
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FIG. 6.14. The concept of spectroscopic resolving power. 


far-field as in Fig. 6.15, we see that D/o cos «' isthe number N of rulings on the 
grating. Thus we have 


2/52 = MN, (6.13) 


or, the resolving power of a diffraction grating is the product of the number of 
rulings and the order of diffraction. 


D, 
t 


FIG. 6.15. Resolving power of a plane grating. 


Modern spectroscopic gratings for the visible 


4 region of the spectrum can be 
50-300 mm wide and can have 300-1000 ruli 


: ngs per millimetre (but these are 
© Mat a resolving power of order 3 x 105 is theoretically 


such a grating. 


We can obtain the resolving power of a prism by similar reasoning to that 
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used above, but it is easier to proceed from first principles. Figure 6.16 shows 
a collimated beam of wavelength À traversing a prism, with a plane wavefront 
Z of the transmitted beam. If the wavelength is decreased by 6/ the refractive 
index will increase by a corresponding amount dn on account of the 
dispersion of the glass; the optical path length through the base of the prism 
will increase by ôn- d, where d is the length of the path through the base, and 


FIG. 6.16. Resolving power of a prism. 


the wavefront in the new wavelength will turn through a certain angle to a 
Position X’ as in the figure. In order to make this angle equal to 4/D, so that it 
corresponds to the resolution limit, we have to make the change in optical 
path equal to one wavelength, i.e. we put ôn* d = À. Thus we find 


éned 
D 
or, 
Ae ee (6.14) 
oA dż 


he spectroscopic resolving power of a 
as the difference between the extreme 
uld not be used. 


This is the very simple formula for t 
prism. Strictly d must be interpreted F 
light paths across the beam, since the edge of the prism wo! 


6.5. Multiple-beam interference 


In Sections 6.1 and 6.2 we showed how the interference effects between two 


beams, giving cos? fringes, are used. The diffraction grating (Section 6.3) 
works through the interference of many beams, since it 1s only when the 
phases of all these coincide (as in Fig. 6.7) that a maximum of light intensity is 
found. In this context the grating is a multiple-beam interferometer. If we 
illuminate a grating with collimated monochromatic light we find maxima in 
the far field in directions a’, given by eqn (6.7), and we can plot these as in Fig. 
6.17 with vertical broken lines corresponding to the order M. The angular 
width of each maximum is, as in Section 6.4, 2/D, where D is the full width of 
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disin 2’) A(sin 2’) 


sin x 


FIG. 6.17. Diffracted monochromatic |i 


ight from a grating as an example of 
multiple-beam interference. 


the diffracted beam, and we have D = No cos #'. Thus the angular width is 
dx’ = 7/No cos a’, 


and the increment in sin g’ between successive orders is A(sin x’) = 4/o from 
eqn (6.7). Thus 


d(sin x’) 1 


=— (6.15) 
A(sin «’) N 


tailed calculation, see, for example, Born and 
Wolf (1965). 


Other forms of multiple-beam interference t 
reflecting surfaces. In Sect: 
films, but considered as 


x ake place between parallel 
lon 3.1 we discussed interference effects in thin 


an approximation only two beams—those first 
reflected from the upper and lower surfaces. These two beams would be 
beams 1 and 2 in Fig. 6.18, but actually the light is multiply reflected as 
indicated by the broken lines, and strictly all the beams should be taken into 
account in calculating the interference effects. 

In the case of the oil film or, for exa 
magnesium fluoride on glass, the effect: 
negligible, since they are much fainter th 
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FIG. 6.18. Multiple reflections in a thin layer. 


R 


<a 
coe 6.19. Principle of the Fabry-Perot interferometer. The films, in practice 
A Pported on plates of fused silica, have high reflectivity and low transmission. The 
i ngle ofincidence is exaggerated to separate the rays, although in fact all the successive 
ransmitted wavefronts overlap almost completely. 


Perot interferometer. This consists 
faces with high reflectivity and low 
g between the layers is d and 


Other cases. Figure 6.19 shows the Fabry 
of two accurately plane and parallel sur! 
transmission (R and T respectively). The spacin 
collimated monochromatic light from a broad source falls on the first 
surface.+ We suppose the incident beam to be broad enough and the angle of 
incidence to be small enough to ensure that many multiply reflected beams 
emerge, superpose and interfere in the far-field on the right-hand side. The 
interference pattern in the far-field can be brought to a focus by means of a 
lens and, as for the Michelson interferometer, it must, by symmetry, consist of 
concentric bright rings, each corresponding to a certain angle of incidence 0 
at which all the beams of a certain wavelength are in phase. 
To find the form and spacing of the fringes we note that, as in Section 3.1, 
the optical path difference between successive transmitted beams is 
=2dcos 0. Thus we can write down the complex amplitudes of the 
+ The reflecting surfaces could be very thin silver layers supported on glass or fused 
not silver, like the end mirrors of lasers, 


Silica, They are usually dielectric multilayers, 
and typically we could have R ~ 0.95 and T ~ 0.05. 
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successive transmitted beams as follows, taking the origin of phase at the 
point of emergence of the first beam, 

T, 

TR exp (i2zp/A), 

TR? exp (i47p/2), 

TR? exp (i6xp/A), 


Note that T and R refer to intensities but our present calculation is concerned 
with complex amplitudes, so that T gives the amplitude transmission through 
two surfaces, as required, and similarly for R. The total transmitted amplitude 
is the sum of terms like these. To simplify the calculation we assume the 
number of terms is infinite, and we then merely have to sum a geometric series 
with common ratio R exp (i2zp/A). The result is 


T 
1 — R exp (i27p/2) ` 


To get the transmitted light intensity we take the squared modulus, according 
to the rule in Section 1.3, giving 


Transmitted complex amplitude = (6.16) 


T E a 
1 + R? — 2R cos (22p/2) (1 — R)? + AR sin? (np/A) 


Finally, if we putR+T 
we have 


= 1, i.e. we neglect absorption in the reflecting layers, 


Fabry-Perot transmission (0) = 


4 
iff + T Sy sin? (= acos o)}. (6.17) 


It is easiest to see the general form of the multiple-beam fringes by regarding 
1(0) as a function of the Phase difference = (2n/2)d cos 0 which appears in 


uantity 4R/(1 — R)? is of order 


ent Way of using the 


is to detect only the central fringe, as we 


Interferometers and spectroscopes 97 


z 


M=999 1000 1001 


1) TR-S 


FIG. 6.20. The fringe shape for the Fabry-Perot interferometer as a function of 
= (27/2) d cos 0, half the phase difference between the maxima. The orders of 
interference indicated are notional. 


interferometer, and to change the wavelength by varying the spacing d. It is 
then called a scanning interferometer. In this mode the maxima for wavelength 
4 must occur at spacings d = M2/2, where M is an integer, the order of 
interference (but in the Fabry-Perot the order may be between 103 and 10°, 
whereas in the diffraction grating it rarely exceeds 10). The orders are 
numbered typically in Fig. 6.20. Let the half-width of the fringes be given by 2e 
as a fraction of the order—i.e. we suppose that, if we put d=(M + £)4/2 in 
eqn (6.17), the transmission falls to 0.5. Substituting in eqn (6.17) we have 


1 aR o : 
z= ifi +7 ZR sin? (Mx + enh 


sine and putting sin ex ~ em, We find 


Removing Mz from the argument of the 


2 1- R 
aa aR} ` 
This result is usually expressed in the form, fringe-spacing divided by fringe- 
width is 
nR"? (6.18) 
F= R s 


This quantity F is called the finesse.t We can use it to get an estimate of the 
resolving power of the Fabry-Perot interferometer. This is defined as for all 
spectroscopic devices as 7/54, where now 64 is the measure In wavelength 
units of the quantity 2e we found above. In the equation d= Mi/2 we now 
keep d constant and find the increment 54 corresponding to an increment 2¢ 


in M. We have ; g 
q- M42090- 


2 


+ Equation (6.15) gave the corresponding quantity 
its reciprocal. 


for the diffraction grating, or rather 
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or, to the first order in small quantities, 
2/8} = M/2e. 
Substituting 1/F for 2e we find 
4/62 = MF, (6.19) 


or, the resolving power of the Fabry-Perot is the product of the order of 
interference and the finesse. By comparing eqn (6.19) with eqn (6.13) it can be 
seen that the finesse plays a role in the theory of the Fabry-Perot similar to 
the number N of interfering beams in a diffraction grating. Thus F is 
sometimes called the effective number of interfering beams. 

In the Fabry-Perot a wavelength /, may give a fringe maximum of order M 
at the spacing d, while at the same spacing another wavelength 4 may also 
have a maximum, but of order M + 1. Then we have 


MA, = (M + 1), = 2d. 


The interval between these two wavelengths is the maximum length of 
spectrum which can be studied without confusion between spectra of different 


orders, and it is called the free spectral range. Denoting this quantity by AÅ we 
have from the above equation 


M(4 + A2) = (M + 1)}, 


from which, if M is much greater than unity, we obtain the two useful 
expressions for the free spectral range 


^l = ai EF (6.20) 


There is a similar effect of overlapping orders with the diffraction grating. 
Methods of avoiding confusion between overlapping orders are described in 
books on spectroscopic techniques. 


6.6. Thin film interference devices 


We mentioned antireflection coatings in Section 6.5 and we now discuss these 
and other related devices in more detail. 

Let a surface of glass of refractive index n be coated with a layer of material 
of refractive index no, less than n, and of thickness d such that Nod = 4,,/4 
where /,, is a chosen wavelength, usually in the middle of the visible spectrum. 
Small fractions of incident light will be reflected from both surfaces, as in Fig. 
6.21(a) and, since the phase changes on reflection will be the same for each 
fraction (Section 3.1) these fractions will be out of phase with each other by 7 
and they will therefore interfere destructively. Iffurthermore nå = nit is found 
from eqn (4.4) or (4.5) (by putting 0, = 0, = 0) that the intensities of the two 
reflected fractions are equal and the resultant reflected intensity will be zero. 
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An 


sia - 


FIG. 6.21. An antireflection coating; the rays are intended to suggest plane 
wavefronts incident normally. 


This is the principle of the simplest kind of antireflection coating. The 
condition nod = 4,/4 only holds at the one wavelength, Am, SO that at 
neighbouring wavelengths there will be some residual reflection and the 
graph of reflectivity as a function of wavelength will be as in Fig. 6.21(b). We 
may also note that the condition of z phase difference only holds at normal 
incidence for À»; at oblique incidence eqn (3.4) gives the optical path 
difference between the interfering beams and eqns (4.4) and (4.5) would be 
used to calculate the effects separately in p ands polarized light. i 

The above is a simplified sketch of the theory of such coatings which 
ignores multiple reflections. To calculate the properties more accurately and 
to deal with more complicated coatings made up of several films it is 
necessary to use electromagnetic theory. However, it is still possible with our 
method to see, for example, the principle of multilayer high reflecting 
coatings. These are made up of several (11 or 13) layers of alternating high 
and low refractive index, each of optical thickness nd equal to Žm/4, as in Fig. 
6.22. Then if we apply similar reasoning to the above, recalling rhe Se an 
phase changes on reflection (Section 3.1), we find that the beams reflecte 
from successive interfaces are successively 27 or zero behind each other in 
Phase, i.e. they are all in phase and soa very high reflected intensity bailas up. 
The end reflectors in lasers are made on this principle. Their behaviour at 
other wavelengths than Am and at non-normal incidence is complex and it 
cannot be explained by our elementary approach. 


6.7. Spectroscopy in general 
In the preceding sections we have described only a few of the large eee of 
spectroscopic methods and instruments based on apparently many di erent 
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cmemerrries 


hh A substrate 


FIG. 6.22. A multilayer coating giving high reflectivity at the design wavelength Ay. 


The layers are of alternately high (H) and low (L) refractive index and the optical 
thickness of each is Ay/4. 


principles, but in fact there are only a few underlying ideas, and it is mainly the 
technical details concerned with adaptation for special purposes which differ. 
Spectroscopy is concerned with measuring the proportions of different 
frequencies or wavelengths in a beam of polychromatic light. If we form a 
two-beam interference fringe system, e.g. with Young’s apparatus (as in Fig. 
6.1), the fringe spacing is proportional to the wavelength, and thus the 
intensity distribution in the fringe system formed by polychromatic light 
contains in an indirect or coded form the information we seek. The same is 
true for the Michelson interferometer—the fringe function is an encoded form 
of the spectrum. In both cases the decoding process is the same—taking the 
Fourier transform of the fringe function—but the Michelson interferometer 
gathers more light flux, and it is therefore preferable for most purposes. 
The Michelson interferometer can also be regarded as a multiplexer. In 
telecommunications it is common practice to use a single line or channel to 
carry several messages simultaneously, e.g. by using the messages to modulate 
different carrier waves transmitted at the same time. Now if the mirror in the 
Michelson interferometer moves with velocity v the fringe function for 
monochromatic light oscillates at frequency 2v//, as can be seen by putting 
z = vt in eqn (6.3). Thus a wavelength 2 is modulated at frequency 2v/A, and 
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cere nner is a multiplexing device which modulates each wavelength 
ifferent frequency. The multiplexed signal is then decoded by taking i 
Fourier transform. $ Sai 
m e ideas from communication theory have led to the development in the 
Pi years of several new spectroscopic devices intended to increase light- 
E ering power or speed of operation. For example, we can have a grating 
A onochromator with arrays of randomly spaced apertures in place of the 
ormal entrance and exit slits, as in Fig. 6.23. Such a system would apparently 


Direction of dispersion 


—_ 
Range of 
oscillation 
eae i Multiple slits, randomly spaced, of a modulation spectrometer. The exit- 
cule th y is similar. Wavelengths throughout the whole spectrum are transmitted, but 
y the wavelength for which the prism is set is strongly modulated. 


nding to the greater wavelength range 
ting is oscillated at a certain 
lated at this frequency but 


have reduced resolving power correspo! 
covered by the apertures, but if the prism or gra 
frequency the central wavelength is strongly modu 
neighbouring wavelengths are not. 
Systems like the Michelson inter! 
always produce an output which has to 


this is because the free spectral range ©} 
order to produce a spectrum direct rather than encoded we have to use 


multiple-beam interference, e.g. a diffraction grating or a Fabry-Perot 
interferometer. In these and many other kinds of multiple-beam spectros- 
copes the bright fringes are narrow enough to allow fringes from many 
neighbouring wavelengths to be formed in between them. 

The dispersing prism is a special case outside these classes. All the other 
systems we have mentioned rely on geometry alone for their effects— 
diffraction at slits followed by interference, OF reflection at mirrors followed 
by interference. The prism depends on a property of a material medium, 
namely dispersion, whereas the other systems could operate in a vacuum with 
thin films of conducting, i.e. reflecting material as mirrors, screens, and beam- 
splitters. 

In light scattered from a laser beam 
turbulent gas stream there are fluctu 
explained in Chapter 1, interpreted as a SP 
spectrum of the scattered light. Let the intensity 


ferometer based on two-beam fringes 
be decoded to give the spectrum, and 
f two-beam fringes is zero. Thus in 


by, say, a colloidal suspension or a 
ations in intensity which are, as 
ead of wavelengths in the 
in the beam as a function of 
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time be I(t); we can define the normalized autocorrelation function of the 
intensity as 


ca f I(t)I(t + 1) dt 
=f 


(see Appendix), where T is a time which is long compared to the fluctuations 
in question. From the autocorrelation theorem (Appendix) C (t) is propor- 
tional to the Fourier transform of {G(v)}?, the square of the spectrum of the 
light, so if we measure C(z) we can obtain the spectrum. If the spectrum is very 
narrow, as it would be in scattered laser light, the delays for which C(t) has to 
be measured can exceed 107° s, and then the autocorrelation can be 
measured directly by rapidly responding detectors which record individual 
photoelectrons. This technique is called correlation Spectroscopy, and it has 


been developed in recent years as a method of spectroscopy suitable for very 
narrow spectral lines. 


Problems 


6.1. In Young’s interference experiment the source pinhole and the receiving screen 


are each | m from the two secondary pinholes, and these are 1 mm apart. (a) 
What is the fringe spacing for light of wavelength 546 nm? (b) Estimate the 
maximum diameter of the source pinhole for fringes of good contrast to be 


formed. (c) What would be the effect on the fringes if the two secondary pinholes 
were not of equal size? 


6.2. In an experiment with a stellar i 
had zero visibility for a wavel 
separation of the mirrors of 3 
arcseconds. 

For the Michelson interferometer of Fig. 6.5, find an expression for the radius of 

the Nth circular fringe from the centre of the far-field interference pattern, in 

terms of f,, fz, and 2. 

6.4 Calculate the form of the fringe function in a Michelson interferometer for a 
Spectrum of rectangular profile. Plot a graph of this function for the case where 

the width of the spectrum in frequency units is 20 per cent of the mean frequency. 

The red line of cadmium (644 nm), as produced by a certain discharge tube, is 

found to have a coherence length of 200 mm. Estimate the width of the line in 

wavelength and frequency units. 

6.6. A reflection diffraction grating has rulings with 1 um spacing. Draw a graph of 
the angle of diffraction as a function of wavelength for the first-order spectrum if 
the illuminating beam is at normal incidence. 

6.7. In the arrangement of Problem 6.6, if the shortest wavelength to be used is 


200 nm, calculate the ange, i.e. the wavelength range without 


i d T spectrum. 
68. A dispersing prism has an angle of 60° 


nterferometer the fringes from the star Betelgeuse 
ength in the middle of the visible spectrum for a 
m. Estimate the angular subtense of Betelgeuse in 
6.3. 


6.5. 
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69. The prism of Problem 6.8 has refractive index 1.791 for the mercury line of 
wavelength 436 nm. Estimate the spectroscopic resolving power of the prism in 
this region of the spectrum if it is equilateral with a 20 mm base. 

6.10. Draw a graph of the fringe function of a Fabry-Perot interferometer with 
R =09, T = 0.1. Calculate (a) the finesse and (b) the minimum transmission. 

6.11. What is the spectroscopic resolving power of the Fabry-Perot in Problem 6.10 if 
the spacing of the plates is 5 mm? Calculate its free spectral range at wavelength 
500 nm. 


7. Laser light 


Glass flashing. That's how that wise man what's his name with the burning glass. Then 
the heather goes on fire. 


James Joyce: Ulysses 


For the purposes of this book lasers are simply sources of very intense, 
spectrally pure, and spatially coherent light. Everything we describe could, in 
principle, be done with ordinary thermal light passed through a monochro- 
mator of very high resolving power and then through a diffraction-limited 
pinhole. However, the light intensity would be so low that the experiments 
would in practice be impossible. Helium-neon lasers of the kind now 
commoner than sodium lamps in many laboratories produce about 1 mW of 
coherent monochromatic light of wavelength 632.8 nm, but the brightest 
available thermal source, an ultra-high pressure mercury lamp, would 


produce about 10 orders of magnitude less light power of the same coherence 
and monochromaticity. 


7.1. Laser beams 


The formation of the charact 
laser can only be fully explai 
e.g. Svelto (1976), but we cal 


eristic narrow intense beam of a helium-neon 
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of the mode, i.e. the shape of the laser light beam, is found by asking what 
distribution of complex amplitude across one of the mirrors will, on 
propagating to the other mirror and back again, reproduce itself so that a 
stable mode structure can be generated by the electrical discharge in the tube. 
The question may be rephrased, what distribution of complex amplitude will 
reproduce its shape on propagating to the far field, or in the terms of Section 
3.4, what function is its own Fourier transform? There are many solutions to 
this mathematical problem but it turns out that the simplest solution is the 
one which is appropriate to the laser under ordinary conditions. This is the 
Gaussian function, of which the modulus of the complex amplitude has the 
form exp — x(x? + y2), and it can be shown (Svelto 1976) that as the beam 
propagates the intensity profile always has the same shape, merely expanding 
or contracting laterally. This is shown in Fig. 7.1 where the arrowed lines 
indicate levels of constant intensity in the beam. After focusing by a lens the 


FIG. 7.1. A Gaussian beam. The arrowed lines connect points on the phase fronts 
where the intensity is 1/e? that at the centre of the beam. The phase fronts are plane at a 
beam waist. 


beam contracts to a ‘waist’, which is like the focus ofan ordinary beam froma 
‘point source’ of thermal light, and expands again. The phase fronts areshown 
in the diagram; at some distance from the waist they are spherical surfaces 
centred almost exactly on the waist, i.e. they are like geometrical wavefronts. 
Near the waist the phase fronts decrease in curvature and they are plane at the 
waist. We remarked in Section 2.2 that geometrical wavefronts do not 
Coincide with physical wave fronts near foci and Fig. 71 shows this clearly, 
since the geometrical wavefronts would shrink to a point at the focus, i.e. the 
beam waist. In fact the beams discussed in Chapters 2 and 3 have uniform 
intensity across the wavefronts until they are cut off sharply by an aperture 
stop or the rim of a lens and for such beams the form of the phase fronts near 
the focus is more complicated than in Fig. 7.1 (see Born and Wolf 1975). 


7.2. Coherent light speckle 

e is illuminated with laser light a 
d with a fine network of bright and 
n as laser speckle, although it can 
nt coherence. The same effect is 
h surface, e.g. ground glass, 


If an optically rough or scattery surfaci 
striking effect is seen: the surface is covere 
dark patches. This effect has become know 
also be observed with thermal light of sufficie 
Seen in the far-field pattern from a roug 
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illuminated with a laser beam as in Fig. 7.2. The explanation is that each point 
on the surface scatters a beam which is coherent with the beams from all the 
other scattering points, but there are random phase relationships between 
these beams, so that in the far-field we see a superposition of interference 
patterns between all pairs of scattering points. These interference patterns 
have random spatial frequencies, phases, directions, and contrasts, and the 
coherent sum of them all is the speckle pattern. The maximum spatial 
frequency in the pattern corresponds to interference between pairs of 
scattering points at opposite ends of a diameter of the ground glass. 


Le 


Laser beam _ Speckle 
Pattern 
Ground 
glass 


FIG.7.2. Formation ofa s 
all distances from the diffus 
at all distances. 


peckle pattern by a random diffuser. Speckle is formed at 
er, and the pattern has nearly the same statistical properties 


The explanation of the speckle 


pattern seen by looking at the surface of the 
scatterer rather than at the far-fiel 


Id is slightly different. In looking at a certain 
point on the surface we see the coherent sum of all the scattered beams within 
the radius of a resolution limit around that point, and this coherent sum may 
again have a Tange of intensities, depending on the random phases of the 
scattered beams. This argument shows that the scale of detail seen in the 


speckle pattern ona scattering surface Corresponds precisely to the resolution 
limit of the optical system used to view it. 
Coherent light s 


7.3. Holography 


In Section 3.1 we saw how two intersecting coherent collimated beams 
produce straight and parallel interf 


l 1 erence fringes. If the beams intersect at an 
angle 0 the fringe spacing perpendicular to the bisector of the angle 0 is 
a = }/(2 sin 40). If we record these fringes on a Photographic plate or other 
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recording medium by placing it in the beam as in Fig. 7.3 and then developing 
the plate, we have, in effect, a diffraction grating (Chapter 6). We next set up 
the grating with one of the beams switched off, say beam 2, as in Fig. 7.4, and 
we find several diffracted beams of different orders. The grating equation (eqn 
(6.7)) for a transmission grating can be written 


sin g — sina’ = Mio, (7.1) 
and in this case we have « = 0/2. Then for the zero-order diffracted beam 
(M = 0) we have «' = g, i.e. an undeviated beam, and for M = 1 we find 

sin a’ = sin 0/2 — d/o; 
but from the way we made the grating we have 7/ = 2 sin(6/2),so that for the 
first-order beam, 

a’ = — 6/2. (7.2) 
This means that the first-order diffracted beam travels in the direction in 
which beam 2 of Fig. 7.3 was travelling. Thus we have ‘reconstructed’ beam 2 
by illuminating the grating with beam 1. 


Another way of looking at this process is to say that in photographing 
fringe pattern we are attempting to record the complex amplitude of beam 2 


the 


Photographic 
plate 


Fringes 
throughout 
overlap 
rm fringes throughout the overlap 
pattern with this spacing. 


FIG. 7.3. Two collimated coherent beams fo n 
region, with spacing 44/sin 40. The plate records a grating 


sinusoidal grating produced as in 


FIG. 7.4. Diffracted beams from th i 
nstruction. 


Fig. 7.3. The first-order beam is the holographic reco 
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at the plane of the photographic plate. The photograph does not record 
unambiguously everything about this complex amplitude distribution. Thus 
the fringe spacing tells us that the phase changes by 27 every fringe, but we do 
not know in which direction the phase is increasing. This ambiguity can be 
regarded as the cause of the appearance of the other diffracted beams of 
orders 0, — 1, 2, 3, etc. On the other hand, if we had attempted to record 
beam 2 by placing the photographic plate in it without beam 1 to form 
fringes, we should have obtained merely a uniform blackening, i.e. a record of 
the intensity, and this could not contain any information about the direction 
from which beam 2 had come. 

Beams 1 and 2 can be regarded as originating from point sources P, and P, 
at infinity. Then in the terminology of holography we formed a hologram of P3 
with P, as reference, and we then reconstructed P, with the same reference 
point and using the same wavelength light. 

A similar thing occurs if P, and P, are at finite distances, as in Fig. 7.5. The 
fringes on the hologram are now curved, but it is again found that if the 


hologram is illuminated with P, alone then P, will be reconstructed (as in 
Fig. 7.6), and vice versa. 


FIG. 7.5. i i 
IG. 7.5. Formation of a hologram of a point P, with reference point. 


Hologram SS 


FIG. 7.6. Reconstruction of the 


hologra: a 
appears to come from the point gram of Fi, 


g. 7.5. Th us i m 
B; behind tte e first-order diffracted bea 


hologram. 
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Now suppose we have a reference point P, and an array of N object points 
P3, P3, . . . Py4,,and we record the interference pattern as before. The sets of 
fringes formed by interference between P, and each of the other points will all 
be formed, and, provided the recording medium has enough range, they will 
be superposed. Then on reconstructing with P, the image of the array of N 
Points will be formed. This is the principle of holography as invented by 
D. Gabor in 1948. Ultimately the array {P;} becomes a continuous distri- 
bution, and the hologram is a record of the interference pattern between the 
reference beam and the complex amplitude scattered from this continuous 
distribution. Let Eọ(x, y) be this complex amplitude as a function of 
coordinates (x, y) in the plane of the hologram plate, and let E,(x, y) be the 
complex amplitude due to the reference beam. The light intensity in the 
interference pattern is |E) + E,|?, and to a reasonable approximation the 
complex amplitude transmission T, of the developed hologram is a linear 
function of this, 


T, = ko — ki|Eo + El’, 


where kọ and k, are constants. In the reconstruction process the complex 
amplitude transmitted by the hologram is T,£,, and on multiplying out the 
Squared modulus we see that this is 


T,E, = koE, — k, E-EoE$ — k, EP ES 
=k, |E,’ Eo — k, EE). 


The intensity |E,|? of the reference beam is roughly constant over the 
hologram, since it comes from a point source at some distance. Thus the 
fourth term in eqn (7.3) is equal to a constant multiplied by the complex 
amplitude E(x, y) at the hologram due to the original object, and this term 
therefore accounts for the reconstructed image of the object. There are, 
however, four other terms in eqn (7.3) to be accounted for. It is easier to see 
the significance of these by assuming that the reference beam is collimated 
and that it meets the hologram plate at an angle of incidence 0, as in Fig. 7.7. 
Then the complex amplitude in the reconstructing beam, assumed to be ne 
same as in the reference beam, can be taken as exp {i2n/Ay sin 0}. 
Substituting this value in eqn (7.3) we obtain for the transmitted amplitude on 
reconstruction, after regrouping the terms, 

T,E, = {ko — ky(1 + |Eol?)} exp {i(27/4)y sin 0) 

— k, Ef exp {2i(2n/A)y sin 0} — ky Eo- 
ave travelling in the same direction as 
ent intensity; it corresponds to the 
d we show it as beam 1 in Fig. 78. 
nt 2i(27/A) sin 0, travels ina 
It can be shown to produce a 


(7.3) 


(7.4) 


Of these terms the first represents a W 
the reconstructing beam but with differ 
zero-order diffracted beam in Fig. 7.4 an 
The second term, on account of the expone: 
general direction 0' such that sin 6’ = 2 sin 0- 
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Reference 
beam 


FIG. 7.7. Forming a hologram of an extended object with an oblique reference 
beam. 


spurious real image as in Fig. 7.8, and it corresponds to the diffracted beam of 
order —1 in Fig. 7.4. The last term 


Reconstruction 
beam 


FIG. 7.8. Reconstruct: 
second term in eqn (7.4), 


ion of the holo 


Produces another į 
usually very distorted. Beams 1, 2, and Iü 


gram of Fig. 7.7. Beam 2, produced by the 
mage of the object. This image is real but 
orrespond to the three terms of eqn (7.4). 
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Beam-splitter 


Focus of 


Mirror 
laser beam 


(a) 


Positions 


(b) 
FIG.7.9. (a)A typical optical arrangement for taking a hologram. The laser beam is 
focused down by means of a microscope objective and allowed to diverge. (b) 
Reconstruction, showing that the image can be seen from a range of angles to give 
Stereoscopy. 


7.4. Hologram interferometry 


The holographic image is a reconstruction of the complex amplitude of the 


light scattered from the object with a particular geometry of illumination. 
Suppose that a hologram of an object is taken but that the object is left in 
Position and with the same illumination at the reconstruction stage. If the 
hologram plate is replaced exactly in its original position after development it 
will produce a virtual image exactly coinciding with the object. In practice 
there will generally be a slight displacement between them, as in Fig. 7.10. The 
light coming through the hologram scattered from the actual object is 
coherent with the light from the reconstructed, virtual object, and there can 
therefore be interference between these two beams. If the relative displace- 
Ment is small only one object will be seen but its surface will appear to be 
Covered by interference fringes which indicate the relative displacement 
between the original object and its reconstructed image. This is the principle 
of hologram interferometry. It can be done with optically rough surfaces, in 
fact it is best done with rough surfaces, since then the illumination and 
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FIG. 7.10. A reconstructed image (broken line) superimposed on the slightly 
displaced object. The displacement appears as a fringe pattern in hologram 
interferometry. 


viewing conditions are less critical. Classical interferometry, on the other 
hand, can only be done with smooth, mirror-like surfaces. 

Hologram interferometry is used in engineering and metrology for 
determining displacements and strains of surfaces in a variety of applications. 
The mode of formation of the fringes is intrinsically more complicated than in 
classical interferometry, as can be seen from Fig. 7.11. Each of the two 


Fig. 7.11. The optical path difference 
ferometry. The lo 


viewing direction 


x mapped by the fringes in hologram inter- 
cal displacement vector is d(=PP’), and the local illumination and 
S are specified by the unit vectors r and r’. 
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gure that the total path difference indicated by the fringe system near P is 
W =d'(r +r’), (7.5) 


pce is a rather complicated function of the directions r and r’ and the 
ti pee a we can measure a displacement of a surface in its 
eee 5 in Fig. 7.12, if the illuminating and viewing directions are 

bly chosen, whereas such a measurement would give a null result in 
classical interferometry with smooth surfaces. 


Rough 
surface 


F R x 
IG. 7.12. Measurement of in-plane displacement by hologram interferometry. In 
although this is not essential. 


th jee 

Th Paecugemient shown the surface is viewed normally, 

di e fringes map the quantity d-r, which, as can be seen, is non-zero for an in-plane 
isplacement. 


eferred to in Fig. 7.12 be a 


For example, let the in-plane displacement r 
ne about its centre. To find 


re rotation 5¢ of a disc of metal in its own pla: 
e form of the resulting fringes we have from Fig. 7.13a 


d = (p cos $5, p sin 5¢, 0) 


and we can take r as given by 
r = (0, sin 0, cos 4) 


the z-axis being perpendicular to the plane of the disc. Then if as in Fig. 7.12 
(7.5) for the path difference to 


a view normally to the disc we have from eqn 
e mapped by the fringes, 

W = p sin ¢ sin 050 (7.6) 

The fringes are loci of constant W so that if the illumination vector r is 

constant over the disc eqn (7.6) shows that the fringes are loci of constant 

4 in , i.e. they are straight, equidistant and parallel to the y axis, as in Fig. 

-13b. 
The form of eqn (7.5) suggests that the same fringe pattern could be given 
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(a) (b) 
FIG. 7.13. Hologram interference fringes from an in- 


plane displacement; (a) 
coordinates in a disc rotated a small angle 5¢ in its own pl 


lane, (b) the fringes. 


by different combinations of displacements and illuminating and voe 
conditions since W depends on three independent vectors, all of which cou 
vary over the surface being examined. Thus in our example of Fig. pi nt 
could get the same fringe pattern if, say, the rotation od were doubled an 
sin 0 (for the illumination vector) were halved. Again, the same fringe pattern 
would result if, with the same illuminating and viewing conditions, the 


displacement were changed from an in-plane rotation to a small wedge, i.e. if d 
were given by 


d = (0,0, ex) 
where £ = 5¢ tan 0. This indeterminacy, 
interferometry with specular surfaces, can 
at least two interferograms with different 
Sometimes by making use of other infor 
displacement to be expected. 

Hologram interferometry can be done in different ways. A variation of the 
thod described above is to take two holograms on the same plate, one 
before and one after the displacement. The object is then removed and fringes 
reconstructed images; this is called the frozen 
iation a hologram is taken of a vibrating surface 
with a time exposure lasting for many periods of the vibration. The 
reconstructed image carries fringes showing the form and amplitude of the 
mode of vibration. 


which is not a feature of classical 
in practice be overcome by taking 
illumination or viewing angles, or 
mation about the kind of strain or 


75. Holographic diffraction gratings 


Laser light 115 


e 


ea Photo-resist 
fe Lor 


(a) 


Aluminium 


Zz 


(c) 
FIG. 7.14, _ Making a holographic diffraction grating. (a) A glass blank coated with 
Photo-resist is illuminated with crossing laser beams to form the fringes. (b) The photo- 
resist is developed to give a contoured surface. (c) An aluminium reflecting coating is 


applied, 


aluminizing the resist a reflecting diffraction grating is formed. Techniques 
are available for shaping the groove profile to produce a blaze, i.e. to direct 
most of the light into a single order of diffraction. Also the method can be 
applied to gratings on curved surfaces, and by careful choice of geometry 
better image formation can be obtained than in conventionally ruled 
gratings. 

Hologram interferometry and the manufacture of 
Probably the most important practical applications of 


diffraction gratings are 
holography at present. 


7.6. Spatial filtering 


We saw in Section 3.4 that the complex-amplitude distribution in the far-field 
diffraction pattern of an aperture with complex amplitude variations in it is 
the Fourier transform of these complex amplitude variations to a suitably 
chosen scale. This could be realized experimentally as in Fig. 7.15. A 


transparency placed in an aperture at the front focal plane of a lens is 
illuminated with collimated coherent light. The far-field diffraction pattern of 
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Fourier- 
transform 


n lens y 
Collimated i : 
beam č 
f ———- f 


Far-field plane 


Transparency _ 


FIG. 7.15. Optical transforms. The complex amplitude in the farfield is the Fourier 
transform of the complex amplitude at the object transparency. 


the aperture and transparency is formed at the other focal plane, and from 
Chapter 3 the complex amplitude in this plane is 


ca 


2 
E(x, y) = i T,(é, n) exp— fa (x+ mm} dé dn, 


-œo 


or, putting 


x/Af =s, y/Af = t, 


E(s, t) = Í f T,(č, n) exp — 2ni(čs + nt) dé dn. (7.6) 


Here s and t are spatial fre 


an quency components of the complex amplitude 
transmission T,(é, n), just as 


A PE ace we defined spatial frequency components of an 
intensity distribution in Section 5.5. Thus the complex amplitude at (x, y) in 


the Fourier plane is proportional to the amount of complex amplitude with 
Spatial frequency components (x/4f; y/Af) in the original transparency. TO 


take a simple example, Suppose the transparency is a sinusoidal amplitude 
grating of complex amplitude transmission 


T,=1+ cos 2nsoč. 
If we write this in the form 


Ourier transform consists of delta functions of 
nigin and at (+59, 0). Thus in the Fourier plane 
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lee we The transform of a grating with amplitude transmission of sinusoidal 
. If the grating spatial frequency is sọ the + 1 and — 1 orders are ¿fso from the zero 
order in the transform plane. 


s Fourier Filtered 
Object plane image 
—S > —— /—_>= S — < [— 


AG: 7.17. A spatial filtering apparatus. The filters are placed in the Fourier or far- 
eld plane, and the filtered image appears at the right. 


object is imaged at infinity in the intermediate space; this is found to be 
broadly correct. However, if we were to obstruct part of the Fourier plane we 
should be removing some of the spatial frequency components, and the image 
would appear changed accordingly. In the above example, the intensity 
distribution in the object transparency is |7,|’, ie- 


Io(č) = 1 + 2 cos 27so¢ + cos? 27So¢ 


= 34 2 cos 2msoé + $ cos Ansoé, 
se that we put a filter or mask in 
der component, i.e. the bright 


(7.8) 


and the image would be the same. Now suppo! 
the Fourier plane which removes the zero-or 
Spot Po. The complex amplitude in the image will then be 


Ti=c0 27506, 
and the identity will be the squared modulus of this, 


E =4+ $ cos 4nsoé. 
Comparing eqns (7.8) and (7.9) we see that there has been a complete change 
in the appearance of the image. Remembering that what we see or detect is 
intensity, not complex amplitude, eqn (7.8) represents a periodic structure 
with basic spatial frequency sọ (but also with a harmonic of frequency 2s), 
Whereas in eqn (7.9) the basic frequency is 2so. Figure 7.18 shows to scale the 
relative intensities represented by eqns (7:8) and (7.9). , . 
Ernst Abbe explained these effects physically 100 years ago in developing 


(7.9) 
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(a) 


(b) 


my a , ofa 
FIG. 7.18. Images in coherent illumination: (a) the light intensity in the eri tke 
sinusoidal complex amplitude distribution, as given in eqn (7.8), (b) the intensity 


h lex 
image of the same object but with the zero spatial frequency component of comp 
amplitude removed, 


corresponding to the angle between 


P,). If Po is removed the interference 
is doubled, since i 


P,. 
Abbe also Pointed out that if PL 
interference, merely uniform illumin, 


sin «> Aso. 
Thus the minimum Tesolvable Separation I/sq is 


0 
1/so = Asin a (7.1 ) 
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ie Sal ee in complex amplitude.+ This may be compared with 
= coast ie to incoherent illumination. The functional dependence 
eee aperture is the same, but the proportionality constant is 
Peis to spatial filtering, it is possible to put at the Fourier plane a 
pk te ngo either the phase or the amplitude of certain frequency 
2 A any desired way. As a simple example, the dot structure in 
oe p 5 ped be filtered out in this way. If a transparency of, for 
coe x sete ria bn otograph has prominent features running parallel to a 
eee on,t ese may be filtered out by means of an opaque strip across 
i urier plane at right-angles to the direction. This device may make it 
easier to see other features. 
der tae, E stage of Fig. 7.17 can be regarded as taking the inverse 
lace Th nsform of the complex amplitude distribution in the Fourier 
us from eqn (7.6) the final image is the original object, 


T,(é, n) = f E(s, t) exp {2ni(sé + tn)} ds dt. (1.11) 


æ 
æ% 


If ; e 5 
we differentiate this relationship with respect to ¢ we obtain 


aT, 
= ? -Í 2nisE(s, t) exp {2ni(sé + 1)} ds dt, (aia 


E that the derivative of the original is obtained by 
ourier plane with complex-amplitude transmission 


putting a filter in the 
2zis or, changing back 


Intensity _ 
transmission 


n phase shift 


FIG. 7.19. A differentiating filter. 


to the actual coordinate in the Fourier plane, 2nix/ Af. This is a linear variation 
of amplitude transmission across the x-direction. 
nstant phase shift over the 


The factor i is not important, since it implies a co! 
ing the axis this implies a phase 


whole plane, but since x changes sign On Cross 

shift of z. Thus the filter would be as in Fig- 7.19 with a quadratic intensity 
f Note, however, that because of the process 0 of taking the squared modulus, a given 
Spatial frequency in complex amplitude may be doubled in intensity. 
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transmission across the x-direction and a 2/2 film across one half. Many other 
devices of this kind are possible for analogue computing in the Fourier 
domain. 


Problems 


7.1. A piece of ground glass of diameter D is illuminated uniformly by a laser beam, 
and a speckle pattern is formed on a screen at a distance L. Show that the smallest 
detail in the pattern is of order of size AL/D. If the screen is viewed from 250 mm 


and if L is 2 m, how large must the ground glass be to make the smallest detail 
unobservable to the eye? 


7.2. In a holography experiment the reference source and the object are both 1 m 


from the hologram plate, and they are 100 mm apart. Estimate the scale of detail 
in the hologram fringes if the wavelength is 632.8 nm. 

7.3. A collimated reference beam is used to form a hologram of an illuminated 
pinhole. Sketch arrangement and discuss the form of the hologram fringes. 

7.4. In a hologram interference experiment with a helium-neon laser the surface 
under test is illuminated and viewed at normal incidence. If the displacement to 
be determined is 2500 nm, how many fringes will it be represented by (a) if the 
displacement is normal to the surface; (b) ifit is at 45° to the normal; and (c) ifit is 


in the plane of the surface? In Case (c) suggest a way of improving the sensitivity of 
the technique. 


A plane diffraction grating is to be 

velength 632.8 nm. Sketch the arr. 
required angles if the grating is to havı 
closest grating spacing which could 


Ta: produced holographically using light of 
angement to be used, and calculate the 
e 1000 rulings per millimetre. What is the 
be made in this way? 


8. Optical light guides 


nei a rod toa thin fibre in a bunsen burner flame and break the 

Ri : bre from the thicker parts. If one end is applied to a light source 

Tie a lamp bulb or a laser the light is conducted down the fibre and it 

ae brightly at the other end. This simple experiment illustrates the 

decd wo modern technologies. In this chapter we shall apply the ideas 
veloped in the earlier parts of this book to such light guides. 


8.1. The acceptance angle of a light guide 


The mechanism of transmission of light along a guide is, in the geometrical 
aao approximation, total internal reflection as described in Section 2.2. 
igure 8.1 shows a cylinder of material of refractive index n; if a ray of light 


Aea 


SSS 
-section; the light is confined to the 


FIG. 8.1. A solid light guide of cylindrical cross 
guide by total internal reflection. 


ident on the wall at an angle greater 
= 1/n it will be totally internally 
de is straight until it reaches the 


travelling in a plane through the axisis inc’ 
than the critical angle I, given by sin L 


reflected; it will carry on in this way if the gui rea 
end of the guide. Rays in planes through the axis are called meridian rays; 


skew rays do not lie in an axial plane but equally it can be seen that a skew ray 
once reflected at an angle greater than the critical angle will carry on in a path 
with successive points of incidence lying ona helix on the surface of the guide, 


as in Fig. 8.2. 
It is essential to have a very clean perfectly polished surface without 
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=o a 


FIG. 8.2. A skew ray propagated inside a light guide. 


scratches on glass to achieve total internal reflection without losses. bis es 
not a very serious worry with prisms such as that shown in Fig. 2:5; TT a 
light is only reflected once and small losses do not matter, but in a rib T 
guide a given ray may be reflected very many times, so that losses due 

surface defects must be minimized; the experiment described at the Seige 
of the chapter will illustrate this because specks of dust and scratches on $ e 
fibre will appear bright when light is transmitted through it. Such ae 
losses are prevented by a protective coating or cladding, as in Fig. 8.3, 0! 


(m) 
ENE o) 
a7 s d ‘ 
we 
L 


FIG. 8.3. A protective cladding for a li 
reflected at the interface between the 
indicates the refractive index variati 


n 


ght guide; a meridian ray is totally internal 
guide and the cladding. The diagram on the le! 
on across a diameter. 


refractive index sa 


y n2 which is lower than the index n, of the guide. Then 
there will be a cri 


tical angle I, given by 


sin I, = n/n, (8.1) 
for total internal reflection at the interface and damage to the outer surface of 
the protective layer will not matter. 


Figure 
This is th 


sin 8 = (n? — n}). (8.2) 
This quantity is called the n 


umerical aperture of the light guide, by analogy 
with the use of the same t 


erm for image-forming systems such as the 
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a — 5.4). The above argument was developed for a meridian 
knee Sain ae t at any skew ray incident on the end face at an angle less 
pe cia a a a Thus all rays entering inside a solid angle x sin? 0 
kena ; . and the numerical aperture can be taken as a measure of 
es a ne the guide to transmit light power. For example for a core 
nts ex of 1.52 the cladding index might be 1.48, giving a numerical 

3 üre 0.35, or an acceptance angle @ of 20°. 
chp sae some reservations should be made. First, it can easily be seen 
eee ae poy by eqn (8.2) is the limiting angle for meridian rays, yet 
emer ie A greater angles to the end face can be transmitted, 
eide eir istance of closest approach to the axis of the fibre. Thus 
erie u transmit more light flux than might be supposed from the 
KER i numerical aperture. Secondly our argument has been in terms of 
eee tains and while this is accurate enought to predict the main 
pia guides with lateral dimensions which are large compared to the 
eee va the light it will not serve for smaller guides; for these a 
ee ased on electromagnetic theory would be needed (see, e.g. 

er 1979), 
vies now to the properties of the guide we see t 
rce is focused onto the end of the guide as in 


Source 
image 
/ 


1 
i 
y 
7 
i 
i 


hat if the image of a 
Fig. 8.4, so that it 


Light guide 


the source should be at least as 


F £ 
IG. 8.4. Illuminating a light guide. The image of 
uminating beam should 


l i 
eke as the end of the guide and the convergence angle of the ill 
least as great as the acceptance angle of the guide. 


ence angle of the light from the 
f the guide, then the guide will 
t can. Let the radiance of the 
ambertian radiator; 


i . 

i a all of the end, and if the converg 

S ns is greater than 0, the acceptance angle © 
ccept as much light flux from the source as i 


Nio (Section 2.6) be B W mm~? sr” 1 and let it be a L 
is means that the flux emitted in a direction making an angle ¢ with the 


Surface of the source is proportional to Cos ġ; many thermal sources radiate 
approximately in this way but not, of course, lasers. If the diameter of the core 
of the guide is 2a then the flux collected by the guide is, neglecting the extra 
TA rays mentioned above, and neglecting also reflection losses at the entry 
ace 


n?a?B sin? 0 W; (8.3) 


this expression is proved in Problem 8.2. 
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8.2. Graded index guides 


The clad guides described in Section 8.1 can be of any diameter, depending on 
the application, and in fact some are made as rigid rods several millimetres in 
diameter. For optical communication purposes (see Section 8.4) there are 
advantages in having very small effective diameters, i.e. comparable with the 
wavelength of the light, and these have a different structure from the guides 
described in Section 8.1. The graded index guide is a cylinder in which the 
refractive index decreases smoothly from the axis radially outwards. The 


refractive index distribution may be represented, as in Fig. 8.5, by an equation 
such as 


n=Nny—ar? (r<ro) (8.4) 


n=Ng—ar§ = (r>r9) 

The geometrical optics of a graded index fibre of this kind is more 
complicated than for the simple fibre of Fig. 8.3; these latter may be called, 
correspondingly, step index fibres. However, if we consider rays in a meridian 
or axial plane we can see how a guiding effect is obtained. Consider in Fig. 8.6 


FIG. 8.5. Refractive index profile of a graded index light guide. 
-oc 
- ee a 
_ í _ P 
FIG. 8.6. 


Focusing effect of a graded index guide. 


A eee PP’ oflength Isay; the optical path length from P to P’ along 
indicated ea rom eqn (8.4). Now consider a second path from P to P’ as 
y the double arrows; the geometrical length of this path is greater 


than | ; 

than ue bah Sy through a region of lower refractive index 
$ > Is possible Sl Š ith 

the increase in geometrical di that the decrease in index combined wit 
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FIG. idi 
8.7. Path of a meridian ray in a graded index guide. 


be obje i i 
a se ino image, since the optical path length from P to P’ is constant for 
pea a certain angle. We can quantify this as follows. 
sates Fie E above a meridian ray should follow a periodic 
ler . 8.7. Let the length of each half cycle be l; then we can represent 
r = a sin (7z/l), (8.5) 


where z is i H 
the axial coordinate and a is a constant for the particular ray 


ch 
osen. An element of length ds along this ray is 


ds = /(dz? + dr?) 


Thu: i 
s the optical path length along the ray is, from Section 2.2, 


I 2y1/2 
f ds = ll (no — ar?) fi + (7 cos z) } dz. (8.6) 


K o 
A let a be small so that only rays cl 
sp 3 specifically, we neglect powers ofa 
ute for r in eqn (8.6) from eqn (8.5), expan 


to obtain 
$ az na\? nz 
fn as [ (ms — ga? sin? “fi $ (7) cos” z) dz. 


o 


ose to the axis, i.e. paraxial rays, are 


above the square. Then we can 
d the square root and simplify 


Then wang ca eaa 0 ibaa wee obtain, on 


again neglecting a’, etc. 


1 2 2 2 
[n ds = f(r + oer - i F afi gi a cos z) di G 


o 
aS be seen that when the integration is carried out the third term will 
al at both limits, since it gives sin 0 and sin 7. The first term will give 
(8.5) ly nol, the optical path length along the axis. Now if the trajectory of eqn 
-5) is really a physically possible ray of those rays which form an image of P 
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i i.e. the 
at P’ the whole integral must be constant for varying a and equal to nol, i 
second term must vanish, or, 


2 
Nok” a 


4? 2° 


: $ ee ions of 
This therefore must give the distance between successive intersecti 


the axis: 
Tae (ea (8.8) 
2a)” 


ays 
The above analysis applies only to meridian rays and the theory for “a 
which are skew to the axis is considerably longer. In fact it is found tha 


; ; : ay it 
graded index fibre will confine skew rays to a region near the axis and thus 
acts in a way similar to the Step index fibre. 


The limiting acceptance angle for a 
which the ray just penetrates the regio; 
entering at the axis this can be foun: 
index as a sequence of thin Shells o 
applying Snell's law we find for the 


meridian ray can be taken as that for 
n of uniform refractive index. For a ray 
d by treating the continuously varying 
f uniform index, as in Fig. 8.8. Then by 
acceptance angle 


sin 0 = Vin? = (ny) - ar2)?) 


FIG. 8.8. The li s is 
lated to be parallel to the axis when i d index fibre; the ray 
aaa is —arg. en it reaches the region of uniform refractive 


miting acceptance angle @ f 
or 
a grade 


8.3. Light guides for image transport 


There are three main groups of ap 


I ' applications for 
transporting light simply for illuminati. 


: light guides, (a) for 
ORE Tor transporting images; and 
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fhe pain ese channels. The first application needs no discussion as 
ee pan is obvious; in this section we explain the use of guides for 
pane 3 images. If a bundle of fibres is made as in Fig. 8.9 and a real 
A em onto one end then it is obvious that, provided the 
i, nai ae fibres is kept the same at both ends, the image will appear 
Pe te end of the bundle. Such fibre bundles are made either flexible or 
Pie second case the individual fibres, usually of step index form, are 
together as a solid glass rod. 


> 


F 
IG. 8.9. Image transport by a fibre bundle; the diameter of the individual fibres is 


not shown to scale. 


brightness the convergence angle « 
limage should equal the acceptance 
For commercially made fibres this 
10° and 30°; the latter value 
F/1. This leads to applications 


In order to get the maximum image 
from the lens which projects the initial real 
angle of the fibres, as given by eqn (8.2). 
acceptance angle ranges between about 
corresponds to a lens with aperture ratio 
where large light collecting power is essential. 
Consider, for example, an oscilloscope used for recording a single very 
rapid event detected as, say, a voltage pulse lasting for 107° s. The trace on 
the oscilloscope screen must be photographed and a camera lens of very great 


light collecting power is needed. A typical arrangement would be as in Fig. 
the necessarily thick faceplate of the 


8.10, which shows the phosphor inside 

cathode ray tube, the camera lens, and the film. It is difficult and expensive to 
obtain a camera lens with a large enough collecting angle to give an adequate 
exposure for rapidly occurring single traces and this has led to the 
development of the fibre-optics faceplate. Figure 8.11 shows a plate 
composed of short step index fibres vacuum sealed in a matrix of black glass; 
this plate is used as the faceplate of the cathode ray tube and the phosphor is 
deposited on the inner surface. Light from the electron beam trace on the 
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een l A 
Filectron be Photographic 
Be film 


Faceplate of 
cathode ray 
tube 


FIG. 8.10. Taking a photograph of a single trace on an oscilloscope screen. 


FIG. 8.11. Structure of a fibre-optic face-plate for a cathode ray tube. 


of photographic film pressed against the outs} ; 

A t S 2 # 

this system the light flux collected Loo nce will gies ne With 

proportional to the square of the numerical a a of the phosp On 18 

bya factor A which allows for the Packing TEA ee fibres multiplied 

or other image-forming aberration and the Tesolvin, s. There is no distortion 

the spacing of the fibres. For example, let the individ ee depends only on 

in a honeycomb pattern at centre spacing d and let z bre axes be arranged 

index core of each fibre be 2a. Then it can be seen date se? of the high 

packing is e factor to allow for 
2n 

A =—— (a/d)?. 

v6) 3 (8.10) 
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Pa baa oe power is indicated in Fig. 8.12; a bright narrow line 
i ne side of the face plate is broadened to a width which may be as 
ge as d+ 2a or as small as 2a, depending on the position and orientation of 
the line relative to the fibres. 
as, ee principle used in the fibre optics face plate is applied in 
peas a or example as a way of eliminating field curvature in 
Ca al lens systems and for transporting images through narrow tubes 
long distances for, e.g. inspecting inaccessible cavities in machines and 
chemical apparatus. 


-optic face-plate; the line AA’ is broadened 
In practice the arrangement of the fibres 
broadening would be irregular. 


HG; 8.12. Resolving power of a fibre 
ore than BB’ since it overlaps more fibres. 
would not be as regular as in the diagram and the 


8.4. Light guides for communication 


It was first suggested in 1966 that step-index light guides should be used for 
transmitting signals, i.e. as communication channels. The principle is, with 
hindsight, obvious enough, to send light down the fibre with the intensity 
modulated in time to give either a digital or an analogue signal. However, it 
was only at about that time that light sources and detectors began to be 
available with characteristics suitable to make such a system competitive with 

In order to understand the reasons for 


electrical communication along wires. rstal 
this we have to introduce some ideas about communications. 


Sounds are heard at frequencies between approximately 50 Hzand 15 kHz 
and they are transmitted as analogue signals, i.e. variations of voltage 
proportional to the instantaneous sound pressure, along telephone wires; at 
suitable intervals there are amplifiers, called repeaters, to keep the signal 
strength up- There would be no difficulty in modulating the output of a light- 
emitting diode (LED) of the kind used in hand calculators for example, at 
frequencies in this range, sending the light down a fibre light guide, detecting 

itha photoelectric detector and then converting the electrical signal into 
3 e usual way- However, this would be much more expensive and 
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complicated than ordinary telephony. To see how light guide transmission 
could be competitive we note that sound covers a range of frequencies of, say, 
15 kHzand this can be transmitted not directly in this frequency range but as 
a modulation of a higher frequency, the carrier frequency. Thus in ordinary 
amplitude modulation (AM) radio the carrier may have a frequency of, say, 
1 MHz and the amplitude of this carrier would be varied according to the 


audio signal. For a carrier frequency vo and a single audio tone of frequency V 
the modulated carrier takes the form 


V(t) = Vo(1 + m cos 2zvt) cos 2zVvot. (8.11) 


In this equation m represents the strength of the modulation. Equation (8.11) 
can be rewritten in the form 


V(t) = Vo cos 2nvot + 4mVo[(cos 2x(v + vo)t + cos 2n(v — vo)t)] 


or in complex notation, using the convention explained in Section 1.3 that 
only the real part is to be taken, 


V(t) = Voe ivo 4 mV (e7 27H + vou 4. e7 ritv- volt) (8.12) 
We can now use our Fourier transform ideas again and say that this signal 


is represented in the transform domain, i.e. the temporal frequency domain, 
by 


Vv) = VYod(Vo) + 4mV(5(v + vo) + 5(v — Vo) (8.13) 


and this is shown in Fig. 8.13. The audio signals can occupy a relatively small 
range of frequency covering perhaps +15 kHz on either side of the carrier 


Carrier 
Fw 
0 wy My vyhy 
FIG. 8.13. A carrier wave of fre i 
. quency vo, ii 
shown in the frequency domain. Y Yo, amplitude modulated at frequency Y 


frequency Vo- This range is called the bandwidth. The idea of a bandwidth of 
temporal frequencies needed to transmit certain kinds of signals is paralleled 


exactly by the ideas introduced in Chaj x : 
pter 5 of a range of encies 
needed to transmit certain ge of spatial frequ! 


ed t detail in forming the image of an object. For 
pipes a bandwidth of order 10 MHz is needed, as a easily ks seen by 
estimating the number ofindividual points in a 625 line television picture and 
recalling that the picture is reproduced 25 times a second; the used carrier 
frequencies range up to about 1000 MHz, but this probably represents the 
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racts H i 

P Gahr commercial radio communication. The number of different 
Pepa ais tee copies which can be broadcast simultaneously in the 
som keien Abi un ; y plotting their carrier frequencies and bandwidths 
tesa te iPass asin Fig. 8.14; if neighbouring bands do not overlap 
ei herent separately without interference or crosstalk, provided the 
uned to pick up only the band of frequencies corresponding to the 


vo v 


FIG. 8 Frequency 
» 8.1.4. i A i ant 
bandwidths: Carrier frequencies vo and Vo with their accompanying required 


eral telephone conversations can be 
e line by modulating different carrier 
a telephone line is a 
be carried 


a programme. Similarly sev 
iden simultaneously over on 
few eee for each; the maximum carrier frequency for 
: z, so a few hundred separate telephone conversations can 
simultaneously. 
ae to light guides, it will 
ERES requencies in the range 4.3-7.5 x 10 
about aes frequencies in this range a sing 
anaes separate telephone conversations or 
Skak 7 simultaneously? The reality is, so far, not qui 
fies as said above it can be seen that the range ofc 
A bandwidth are two key ideas in discussing 
ci ue three further questions which arise in const 
sine munication channels are (a) can We have enough a i 
ver the frequency range of the light; (b) cana light guide transmit the signals 
without degrading them enough to make them unrecognizable; and (c) could 
the signals, if undegraded, be separated again without crosstalk? The answer 
to all these questions is “No’ at present. Nevertheless, light guides do offer the 


oe of substantial advantages he number of separate 
ignals carried simultaneously, as Wel to be mentioned 
below. 


Consider first the spacing of the carrier frequ arat 
carriers we should require the frequency ofeach carrier to be stable to within 


about +1 MHz: this is attainable by the most elaborate stabilization 
methods for a few gas lasers but by no means for 10° separate and arbitrarily 
chosen wavelengths in the visible spectrum. Equally, even supposing all these 
separate highly stabilized carriers could be obtained they would have to be 


be recalled from Chapter 1 that visible 
14 Hz. Thus by selecting suitably 
le light guide might transmit 
10® separate television 
ite so spectacular: from 
arrier frequencies and 
g any communication 
dering light guides as 
close-spaced carriers 


in increasing t 
| as in other ways 


encies; for 108 separate 
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sorted out at the receiving end of the light guide (or at intermediate co Pe 
or amplifying stations) by filters or spectroscopes of resolving power he i 
at present quite unattainable under the required working conditions. ae 
practice there are two possible systems, one being to modulate the lig ey T 
at a reasonable frequency, say 10° Hz, and use this frequency as the carrier; 
several carriers with suitably spaced frequencies could be used to cany 
simultaneously different communication channels. The second possibility a 
to use digital transmission and this is the method which is being most ae 
explored at present; in digital transmission the analogue signal, soun! 4 
television or whatever, is pre-coded into digital form by an analogue-to 
digital converter. This will, for example, record the light intensity at a en 
point in a picture as being any of, say 64 different values in the range 0, 
1,2,...63 and encode it in the binary scale as a 6-bit number. A similar 
Process is carried out with sound intensities, sampled at sufficiently close 
intervals of time; the signal is then transmitted as a series of Os and 1s, as in a 
computer, represented as dark and light periods of the LED at the 
transmitting end of the light guide. Rates of modulation and detectan 
approaching 10° bits per second are possible with available LEDs an 

photodetectors, which corresponds to an available bandwidth of about 
1000 M Hz; this is still very large for a single light guide even if it is 
disappointing compared to the actual frequency of the light. We thus have to 
ask whether a fibre could transmit at such frequencies. In other words, if a 


single bit representing a 1 enters the fibre as in Fig. 8.15(a) will it emerge Ae 
(b), in which case no great confusion will occur, or as in (c), when it would be 
(a) 
1. Time 
2 
$ 
E 
-~ 
t 
(c) 
` = 
FIG. 8.15. _ Degradation of a light pulse representing a single bit on transmission 
through a light gui 


ide: (a) input Pulse, (b) slightly degraded, (c) degraded unusably. 
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ated bit rate. unication channel at the 

We can formali bi 
ime echt einai terms of the concepts of Section 5.5, but with one 
to be a delta function of re. ee ere So 
ouipatkon hel eitea iey iee radi itely short burst of light, and the 
analogue of the point ree spread in time into the impulse function, the 
of the impulse functio spread function for an optical system. The time spread 
anemiei wither’ i gives a measure of the duration of signal which can be 
the impulse function ne ste i? ee 
i ; 15) i F Cen 
a ka diferent Requencies ate d = ar ii which shows how periodic 
for a light othe parse factors to the size of the impulse function 
tnderioodie so e rst, known technically as modal dispersion, can be 
pataxiAl meridian SO [geometrical optics:in Section 8.2 we found a path fora 
could be ates in a graded index fibre and it was mentioned that there 
following eal si or other rays, i.e. the optical path lengths along the fibre 
ue bir, anaa mh not the same; but the optical path length along a 
so'that parts ota TA taken for light to travel the ray path (Section 2.3), 
times, prodiciñg -a ignal travelling different ray paths will arrive at different 
material dieren man out impulse function. The second factor is simply 
PAE wate n, the fact that the refractive index of the fibre material 
Since the light ength, i.e. the speed of a light pulse varies with wavelength. 
again AR used in practice has an appreciable spread of wavelengths this 
Tke an to the width of the impulse function. 

Maaa ird feature of light guides which is crucial to their performance, 
ation. In a light guide only several metres long there may bea 


seriou . 3 PN, 
s loss of light by absorption and scattering in the material of the guide 
uide which can be used as 


andit is this whi 

PEE which decides the maximum length of guide v 

expressed agrees link without a repeater to restore the signal. Attenuation is 

meor Sak ecibelst per unit length. The best optical fibres obtainable at the 

Klote TE are claimed to have attenuation as low as one or two db per 

a rain attenuation varies with wavelength and for most fibres there is 

infrared m of attenuation ata wavelength of about 1.3 um, i.e. in the near 
region. Figure 8.16 shows, with some detail omitted, a typical 


attenuation curve. 

RAS si suggests that to min 

State lase råred:sonrce pinaraan iti 

Er rs of different kinds which emit 1 

relativel no clear best choice. Solid-state 

mieri arene cape ne chs 
ispersion in a fibre is likely to 


of attenuation we should 
sent both LEDs and solid 
frared are used but there 
able design emit over a 

so that the effect of 
ereas LEDs emit over 


imize the effects 
h 1.3 pm. At pre 
n the near in 
lasers of suit 
about 2nm, 
be small, wh 
n by a factor 0.1, 


at 10 db is an attenuatio! 


TA deci a 
ecibel is a logarithmic unit such th t 
o attenuation 1070" 


20 db = 
b = 0.01, and generally n db corresponds t 
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100 


Attenuation db km 


04 06 08 10 12 14 16 
Wavelength (um) 


FIG. 8.16. Attenuation curve for a fibre light guide; the curve is drawn to show we 
general trend for many fibre materials but it does not correspond to a particu 
material. 


perhaps 50 to 100 nm. On the other hand LEDs are cheaper, simpler to run, 
and more reliable than lasers. 


The effect of attenuation is to limit the length of fibre which can be used. 
Figure 8.17a suggests a typical digital light signal of the form 101001 to be 
transmitted; ignoring the effect of the impulse function of the light guide, we 
should naively expect the output voltage from the photoelectric detector at 
the exit end to have a similar form. However, in a quantum detection process 
(Section 1.4) the Output will actually consist of individual pulses correspond- 
ing to the detection of individual photons and if the signal has been 
attenuated in the fibre the pulses might appear individually as in Fig. 8.17b, 
i.e. we should have å noisy output. Figure 8.17b does not show only the signal 
photon noise, as it is called; we have indicated also some noise at the tims 
corresponding to zeroes in the signal; this noise is from dark current in the 


photodetector, i.e. stray electrons produced when there is no light, and from a 


variety of other sources of noise. Figure 8.17 illustrates the concept of Pi 
signal-to-noise ratio, i.e. the ratio of the mean output signal to the mean (0! 


C1 C] E (a) 


Signal 


1 


Put signal 101001, (b) the output signal from 
se from various sources, 


FIG. 8.17. The effect of noise: (a) the in 
the photodetector, showing noi 
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ctly root-mean-square) noise. early with a si -to-noi i taht 
too unfavourable 1s will sometimes be taken tri, <a pee pence S 
make a very simplified calculation as follows. Aiii 
. Let W be the light power in watts corresponding to the signal 1, let this 
signal last for time t and let the mean frequency ofthe light be v. The energy in 
a single 1 is Wt and this therefore contains Wt/hy photons on average, Thus 
1 a photoelectrons will be produced on average for each 1, if ņ is the 
ri ER detector. Now it is known that the mean-square fluctuation in 
be tak rd of photoelectrons produced in a given time is equal to the mean 
peau e root mean square variation in the number of photoelectrons 
uced for a 1 will be (n Wt/hv)!/? and the signal-to-noise ratio is therefore 


1/2 
S/N -(F) : (8.14) 


hy 


In this calculation we have ignored several other sources of noise. 


However, eqn (8.14) shows the essential features that the signal-to-noise ratio 
is proportional to the square root of the signal strength W and inversely 
Proportional to the square root of the bandwidth (1/t). We should note also 
that as given by eqn (8.14) the signal-to-noise ratio refers to current or voltage 
output from the photodetector. For some purposes the ratio of signal-power 
to noise-power obtained from the photodetector is used and then the square 
of the expression in eqn (8.14) would apply. Either way it can be seen that the 
effect of attenuation in reducing the light power W is to reduce the signal-to- 
Noise ratio at the output from the fibre and a point is reached at which the 
digital signal must be refreshed by detecting the light signal, re-shaping it 
electronically and re-transmitting it for a further stage. The distance between 
such repeater stations depends on the fibre attenuation and, of course, on the 
strength of the input signal. This introduces the concept of coupling light 
guides to light sources and detectors, briefly touched on in Section 8.1. 

; We saw in Section 2.6 that the light transmitting power ofan optical system 
is proportional to the square of the Lagrange invariant, a quantity sometimes 
called the étendue. For a step-index guide the Lagrange invariant is the 
product of its numerical aperture (Section 8.1) and the radius of the core. 
Then in order to couple a light source as efficiently as possible to the guide we 
must ensure that all of this étendue is used, ie. that rays from the source enter 
at all angles up to the full acceptance angle and at all parts of the core cross- 
Section. If the source is a light-emitting diode with a luminous area much 
greater than the core diameter this can be achieved simply by butting the 
source against the end of the guide. If there are objections to this because, for 
example, the source may be encapsulated in a glass or plastic cover we have to 
use a condenser. This is a lens or other image-forming system; which focuses 
an image of the source onto the end ofthe guide, asin Fig. 8.18. The lens must 
havea large enough diameter and small enough focal length to ensure that the 
core is covered by the source image formed by beams with convergence angles 
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@ (acceptance angle) 


Source 


FIG. 8.18. Imaging a light source on to the core of a step-index guide; if the image 
just covers the core and if the cone of rays just matches the acceptance angle of the guide 
the maximum possible flux is transmitted. 


equal to or greater than the acceptance angle. Let the radiance (Section 2.6) of 
the source be B. Then as in Chapter 2 the condenser collects n?Bc?«? watts 
from a portion of the source of diameter 2c. The magnification of the 
condenser is «/0 and if c is equal to a0/a, where 2a is the core diameter, all this 
power will enter the guide. No more power can be collected by the guide from 
a source of radiance B and for the acceptance angle 0: for if we try to image an 
area of diameter 4c onto the core the convergence angle will become a/2 
instead of « in order to adjust the magnification, and no gain will have been 
achieved. A similar point was made in Section 2.6 in connection with the 
luminance of images and in Section 8.1 when the numerical aperture of a 
guide was defined. 

Finally we stress that this calculation is very approximate. First, we have 
already noted that step index fibres transmit skew rays incident at larger 
angles than the formal acceptance angle 0 as given by eqn (8.2), whereas on 
the other hand graded index fibres do not transmit some meridian rays at the 
angle given by eqn (8.9). Secondly the geometrical optics model is an 
approximation and to get a better calculation it would be necessary tO 


calculate the transmission of light according to electromagnetic theory. For 
this, however, we refer to Midwinter (1979). 


Problems 


8.1. A step index light guide has refractive index 1.54 and the cladding has index 1.52. 


Calculate the acceptance angle. What index would be required for the cladding to 
give an acceptance angle of 25°? 


8.2. Show that a guide of circular Cross-section with radius a and acceptance angle 0 
can, within the approximations of Chapter 8, transmit a flux za?Bsin? 0 from a 
Lambertian source of radiance B. 


8.3. A graded index guide has the radial refractive index distribution 1.52— 2r? with r 
in mm up to a radius rg =0.2. Find its acceptance angle and the distance between 
successive intersections of the axis for a meridian ray. 

8.4. A short section of the graded index guide in Problem 8.3 is to be used as an image- 
forming system. How long should it be if (a) an object at infinity is to be imaged at 
the exit end of the guide or (b) if it is to be an afocal system? 


Appendix: the Fourier transform and 
some of its properties 


Definitions 


Let f(x) be a function of the real variable x, single-valued and possibly 
complex in value. The F(u), defined by 


F(u) = f f(x) exp (—i2nux) dx, (A.1) 


is the Fourier transform of f (x). It can be shown that a reciprocal relationship 
then holds, 


f(x) = | F(u) exp (i27xu) du, (A.2) 


so that f (x) is said to be the inverse Fourier transform of F (u). We think of the 
functions f(x) and F(u) as existing in two different regions or domains, the 
x-domain and the u-domain, and the Fourier transformation links pairs of 


functions in these domains. For example, let f(x) be defined by 
f(x)=1, |x| <a/2, 
(A.3) 
f(x)=0, |x|>a/2. 


This is the rectangle function, written rect(x/a). Then by elementary integ- 
ration we find for its transform, 


F(u) =|alsinc (au), (A4) 
where the sinc function is defined by 


sin 70 


no 


This fundamental pair of functions is illustrated in Fig. A.1, and the relation 
between them is sometimes written in the form 


sinc 0 = (A.5) 


la| sinc au=rect (x/a). (A.6) 
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1-0) 
rect (x) 
m 0 1 
x 
1-0) 1-04 
rect (2x) 4 sinc (ju) 
ai 
=f 0 i —6 -4 0 2 4 6 
RS u 


FIG. A.1. The fundamental Fourier-transform pair, the rectangle function, and the 
sinc function, showing the effect of a change of scale. 


This notation obscures the distinction between direct and inverse transforms 
but this is often unimportant in physical applications. Figure A.1 shows, 
through the scale factor a, how a spreading of one function (a increasing in 
rect (x/a)) causes the transform to be compressed along the u-axis. 


Two-dimensional transforms are defined similarly in terms of functions of 
two variables, 


F(u, v) = IEG y) exp {i2x(ux + vy)} dx dy, 
= (A.7) 
f(x, y)= ff F(u, v) exp {2zi(xu +yv)} du dv, 
and the theorems given below for 


one-dimensional functions apply mutatis 
mutandis to two dimensions. 


(A.2) and a factor (27)~! then appears i 
or again there may be a factor (27) 1/2 
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The delta function 


Fourier transforms can be defined for a great variety of functions, although a 
discussion of the conditions under which any given function can have a 
Fourier transform is beyond the scope of this Appendix. If f(x) is a constant, 
say b, the integral in eqn (A.1) does not converge, and so the transform of a 
constant is not defined. However, if in eqn (A.6) the constant a is very large 
then the right-hand side is unity over this large range, and the left-hand side 
becomes correspondingly narrower and higher, as in Fig. A.2. In the limit as a 
tends to infinity the left hand side of eqn (A6) tends to an infinitely narrow and 
infinitely high spike. This is one way of developing a definition of the delta 


ie 


4 sinc (4u) 


FIG. A.2, 
Increases the 
can also be 


Evolution of the delta function as the transform of a constant. As a 
transform of rect (x/a) tends to a delta function. Other pairs of transforms 
used to define the delta function by a similar limiting process. 
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function 6(u), introduced by P. A. M. Dirac. Thus we have the transform 
relationship 


a=a (u), (A.8) 
where ais a constant. There are many situations in physics where a quantity is 
constant for a long time or distance, so that its transform is very sharp and 
narrow, almost a delta function. Thus it is the limiting process implied in eqn 


(A.8) which is important in physical applications of Fourier-transform 
theory. 


Elementary properties 


If we have two pairs of functions which are transforms then any linear 
combination can make a transform pair, 
aF(u) + bG(u)=af(x) + bg(x). (A.9) 


A shift of origin in one domain corres 


ponds to multiplication by acomplex 
exponential in the other domain, 


f(x +a)=exp (i2xau) F(u), 
(A.10) 

F(u—a)=exp (i2nax) f (x). 
If the above result is applied to the delta function and its transform we obtain 
5(u—u)=exp (i2nupx), (A.11) 


ie. the transform of a complex exponential is a delta function with shifted 
origin. This result enables 


us to write down the transform of a periodic 
function. Thus if 


fx) = a cos 2nugx 


=4a exp (i2nugx)+4a exp (—i2nu9x) 
we find from eqn (A.11) 
F(u)=4a 6(u-uy)+4a 5(u+ uo). 
This can then be extended to a sum of periodic functions, e.g. a Fourier series, 
S(x)=Za, exp (i2znx), 
F(u) = a, 6(u— u,). 
A useful property of the delta function is that it encloses unit area, 
b 
f ôļu— uo) du = l,a<uy<b 


a (A.12) 
=0, uo <a or b<up. 
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Theorems 
If f(x) and F(u) are a transform pair then 
Í |Fu)|? du = f V dx, (A.13) 


which is Parseval’s theorem. 
The convolution of two functions f(x) and g(x) is defined to be 


Soyreatx) = | F(s—x)atx) dx 
2A (A.14) 
= { f@gls-x) dx. 
The convolution theorem states that the transform of the convolution of two 
functions is the product of their transforms, 


S (x) gx)=Flu)Glu), 
or, more explicitly, 
Í exp (~i2na)} | S(s—x)g(x) ax} ds=F(u)G(u). (A15) 


Thus convolution in one domain corresponds to multiplication in the other. 
The autocorrelation function of f(x) is defined as 


fete feo = | SEIS) dx, (A.16) 
where f*(x) is the complex conjugate of f(x); but sometimes it is more 


Convenient to use a normalized autocorrelation function, in which case the 
right-hand side of eqn (A.16) is divided by the normalizing constant, 


f Iœ]? dx. 


-o 


The autocorrelation theorem or Wiener-Khintchine theorem states that the 
transform of the autocorrelation of a function is the squared modulus of its 
transform, 


æ æ 


[exp izro] [sessi a} ds=|F). (A17) 


=a 
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Optical analogies 


The results given in this Appendix are mathematical expressions of many 
physical effects. For example, we saw in Section 3.4 that the complex 
amplitude in the far-field diffraction pattern of an aperture is the Fourier 
transform of the complex-amplitude distribution in the aperture. In a slightly 
different context, the same result means that the complex amplitude in the 
point spread function of a lens is the transform of the complex-amplitude 
distribution in the exit pupil of the lens (Section 5.5). Again, the complex 
amplitude of a plane wave striking an aperture in a plane screen normally is 
represented by a constant over the aperture, so that the far-field pattern tends 
to a delta-function shape (i.e. a narrow high peak) as the aperture gets wider. 
This illustrates the scaling rules given in Section 3.4, 

The image of an extended object formed in incoherent illumination is the 
convolution of the light-intensity distribution in the point spread function of 
the lens with that in the object. The convolution theorem tells us that in the 
transform domain this corresponds to multiplying the spatial frequency 
distribution in the object by the optical transfer function to obtain the spatial 
frequency distribution in the image (Section 5.5). 

The fringe function of a two-beam interferometer is the transform of the 
tensity distribution in the Spectrum of the light, or its power spectrum 
(Section 6.2). The autocorrelation function (with time as the variable) of 
intensity of a polychromatic beam of light is, by the autocorrelation theorem, 
the transform of the square of the Power spectrum of the light (Section 6.7). 

Note that in the above summary we have omitted scaling and normalizing 


factors, etc. which would be needed for statements of the results in forms 
suitable for numerical calculation. 


in 


t Fourier transform methods can sometimes 
Stent results, particularly in the use of delta 
self is purely mathematical. For instance in 
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Answers to numerical problems 


In some answers only one or two significant figures are given. This indicates that only 
that precision is physically meaningful, or that the initial data have only that precision. 


Ll. 
1.6. 


29. 
2.10. 


300 m, 300 mm, 300 pm, 300 nm. 
50 pm, 6 x 10!? Hz. 


1.7. Curve 2, 10 per cent; curve 3, 5 per cent. 


-LE 


Focal length r/2; image positions — 50 mm, 100 mm, 66.7 mm; magnifications 
2-1 =% 


5 . Pee! P 2 
- (a) az W mm™?sr™! (in this solution it is assumed that the total area is 10 mm 


and the filament radiates uniformly in all directions) (b) 0.03 W. 


» 10175": 
. 30 mm x 2.5. 


570 m. 

140 mm7!. 
0.8 nm. 

10 m. 

2 mm. 


- 0.5 km. 


(a) 0.985; 0.707; 0.035. (b) 0.970; 0.500; 0.00122. 

(a) 1; (b) 4; (c) 1.987. 

56.3°, 58.0°, 62.2°. 

3.4 um. 

6.7 x 1076 rad, 6.7 x 1077 rad, 1.3 x 1077 rad (the wavelength is taken as 
550 nm). 

7 um in diameter; 2 x 1073 rad mm~!, 

7mm. 

0.5 um, 500. 

(a) NA 0.25, x 125, (b) NA 1.3, x 1000. (The Suggested answers give NA and 
= gua about 4 times larger than needed for resolution of the dimensions 
given). 

(a) 0.546 mm, (b 
0.04 arcsec. 

6.5. 64 = 0.002 nm; ôv = 1.5 x 10° Hz. 
200 nm to 400 nm. 


) 0.13 mm; (c) the fringe contrast would decrease. 


. 6 x 10°, 0.025 nm. 


12 mm diameter. 
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6 um. 

(a) 7.9; (b) 5.6; (c) 0, but fringes will appear if the surface is viewed obliquely. 
The beam intersect at +18.45° to the normal to the grating; 3160 rulings per 
millimetre. 

14.32°. no = 1.481. 

29.12°. 1.937 mm. 

(a) 0.968 mm; (b) 1.937 mm, or any multiple of this distance. 


Index 


Abbe, Ernst, 117 Coherence time, 12 

Aberrations, 32 Collimator, 24 

Acceptance angle, 122 Communication, light guides used for, 
Accommodation, 73 129 

Achromatic doublet, 33, 34 
Afocal system, 34, 35 

Airy, G. B., 51 

Airy pattern, 51 

Angle of diffraction, 89 

Angular dispersion, 90 

Angular magnification, 69 
Angular resolution of the eye, 74 
Anisotropic medium, 62 
Antireflection coating, 99 
Aperture stop, 30 

Astigmatism, 33 

Atmospheric turbulence, 70 
Attenuation, 133 
Autocorrelation, 141 


Complex amplitude, 4, 5, 6 
Complex exponential notation, 4 
Concave grating, 89 

Condenser, 135 

Conjugate distance equation, 23 
Conjugates, 23 

Contrast (of fringes), 41 
Convergence angle, 27 
Convolution, 78, 141 

Cornea, 73 

Corrector plate, 72 

Correlation spectroscopy, 102 
Critical angle, 20 

Crosstalk, 131 

Crown glass, 33 

Curvature, 22 


Bandwidth, 130 
Beam-expander, 53 
Beam-splitter, 38, 83 


Birefringence, 65 De ee 75 
Bit rate, 133 » 133 
Brewster angle, 62 Delta function, 138 


Detector, quantum, 8 
Detector, thermal, 8 


} Differentiating filter, 119 
Calcite, 63 A Diffraction, Ch. 3 
Camera objective, 86 Diffraction at an edge, 55 
Carrier, 130 


1 Diffraction grating, 87 
Cassegrain telescope, 72 


i p Diffractometer, 53 
Chromatic aberration, 33 Diffuser, 106 

Circular aperture, 51 Digital transmission, 132 
Circular polarization, 60 Dispersion, 33, 133 
Cladding of a light guide, 122 Dispersion of a grating, 90 
Coherence length, 12, 86 


Dispersion of a prism, 91 
Coherence patch, 82 Double refraction, 63 


Effective number of interfering beams, 98 
Electromagnetic spectrum, 1 
Electromagnetic wave, 1 
Electron lens, 75 

Elliptically polarized light, 60 
Entrance pupil, 69 

Entrance slit, 89 

Etendue, 135 

Exit pupil, 69 

Exit slit, 89 

Extended object, 76 
Extended source, 40 

Eye, 73, 74 

Eyepiece, 69 


Fabry-Perot interferometer, 95-98 
Faceplate, fibre-optics, 127 
Faraday effect, 65 

Far field, 24, 47, 48 

Far-field diffraction, 49 

Fermat's principle, 19, 124 
Finesse, 97 

Flint glass, 33 

Flux accepted by a guide, 123 
Focal length, 23, 31 

Focal ratio, 72 

Focus, 23 

Focusing by a graded index guide, 124 
Fourier plane, 117 

Fourier transform, 50, Appendix 
Fourier transform spectroscopy, 86 
Fraunhofer diffraction, 49 

Free spectral range, 98 

Frequency, 1 

Fresnel, A., 45 

Fresnel integral, 53 

Fringe function, 85 

Fringe maximum, 56 


Gabor, D., 109 

Galilean telescope, 35 
Gaussian optics, 25 
Geometrical optics, Ch. 2 
Geometrical wavefront, 17 
Graded index guide, 124 
Grating monochromator, 89 
Grating spectrograph, 89 


Hologram, 109 
Hologram interferometry, 111 


Index 147 


Holographic diffraction grating, 114 
Holography, 106-111 

Huygens, C., 45 

Huygens’ secondary wavelets, 44 


Iceland spar, 63 

Image, optical, 21-23 
Image transport, 126 
Impulse function, 133 
Incoherent illumination, 77 
Infinity, object at, 28 
Infrared, 8 

In-plane displacement, 113 
Intensity, 6, 7 

Interference, 4, 12, Ch. 3, 65 
Interference fringe, 37 
Interferometers, Ch. 6 


Invariant, 30 
Iris, 73 
Isophot, 47 


Kerr effect, 65 
Kirchhoff, G., 45 
Kirchhoffs formulation of diffraction, 45 


Lagrange invariant, 29, 30 

Laser beam, 104-105 

Laser beam waist, 105 

Laser light, 11, Ch. 7 

Lens, 21 

Light gathering power, 69 : 

Light gathering power of spectroscopic 
systems, 90 

Light guides, Ch. 8 

Linearity (of a detector), 9 

Logarithmic response, 74 

Lumen, 9 

Luminance, 30 


Magnesium fluoride, 94 
Magnification, 24, 25, 71 
Matter waves, 14 
Maupertuis, P. de, 19 
Mechanical stress, 65 
Meridian ray, 121 
Michelson’s interferometer, 83 
Microscope, 74-76 
Microscope objective, 75 
Minimum deviation, 91 
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Minimum resolvable separation, 118 
Mirror, in raytracing, 29 
Modal dispersion, 133 

Mode, single, 104 
Modulation, 130 

Modulation spectrometer, 101 
Monochromatic light, 11 
Monochromator, 89 
Multi-element lenses, 26 
Multilayer, 100 

Multiple beam interference, 93 
Multiplexing, 100 


Near-field diffraction, 52-55 
Newton's conjugate distance equation, 
29 


Newton’s rings, 41 

Nicol prism, 58 

Noise, 134 

Nonlinear optics, 4 

Nonlinear response of the eye, 74 
Numerical aperture, 76 

Numerical aperture of light guide, 122 


Object, 22 

Objective, 68 

Oblique reflection and refraction, 61 

Oil film, 40 

Oil immersion, 76 

Optic axis, 63 

Optical glasses, 33 

Optical path length, 19, 23 

Optical transfer function, 79 

Order of interference, 40 

Orthogonally polarized beams, 64 
TF, 79 


Overlapping orders, 98 


Paraboloidal mirror, 71 
Paraxial optics, 25 

Parseval’s theorem, 141 

Partial coherence, 82 
Persistence of vision, 8 

Phase change (on reflection), 40 
Phase shift, 5 

Photocathode, 10 

Photocell, 8 

Photoelectron, 8 

Photographic emulsion, 9 
Photometry of optical systems, 70 


Photomultiplier, 9 

Photon, 8 

Photons in interference, 56 
Photo-resist, 115 

Planck constant, 8 

Plane grating, 89 

Plane of polarization, 58 
Point image, 37 

Point spread function, 69, 76 
Polarization, Ch. 4 
Polarizer, 58 

Polaroid, 65 

Polychromatic light, 11, 40 
Power in a wave, 3 

Principal planes, 25 
Principal points, 25 

Prism, right-angle, 21 

Prism spectrograph, 86 
Pupil functions, 76 

Pupils of an optical system, 69 


Quantum detection process, 8 
Quantum efficiency, 9 
Quarter wave plate, 64 


Radiance, 30 

Radio waves, 3, 9 

Randomness (in a light beam), 10 
Ray, 16 

Ray surface, 63 

Raytracing, paraxial, 26 
Rectangle function, 137 

Red cadmium line, 86 

Reflection, law of, 17 

Reflection factor, 61 

Refraction, law of, 17 

Refractive index, 19 

Repeater, 129 

Resolution, 70 

Resolving power of a grating, 91 
Resolving power of a prism, 93 
Resolving power of a Fabry-Perot, 96 
Response time, 8 

Retardation, 64 

Retina, 73 


Saccharimeter, 65 


Scanning interferometer, 97 
Schmidt camera, 72 


Signal-to-noise ratio, 135 


Sinc function, 47, 137 

Skew ray, 122 

Snell's law, 18 

Source, thermal, 7 

Spatial filter for differentiation, 119 
Spatial filtering, 115 

Spatial frequency, 78 

Spatial frequency components, 116 
Speckle, 105 

Spectral sensitivity, 9 
Spectrograph, 87 

Spectroscopes, Ch. 6 
Spectroscopic grating, 89 
Spectrum, electromagnetic, 1 
Spectrum line, 91 

Speed of light, 1, 14-15 

Stationary light path, 19 

Stellar interferometer (of Michelson), 82 
Step index guide, 124 

Straight edge (diffraction), 43 
Superposition, 4 


Taylor, G. I., 56 

Telescope, 68-73 

Telescope, Newtonian, 71 
Telescope, Cassegrain, 72 
Telescope, Schmidt, 72 
Temporal coherence, 83 
Thermal detection process, 8 
Thermal light, 7 


Index 149 


Thin films, 98-99 

Thin lens, 22 

Total internal reflection, 20 
Transfer function, 133 
Two-beam fringes, 39 
Two-dimensional transform, 138 


Ultraviolet, 8 
Unpolarized light, 61 


Virtual image reconstruction, 110 
Visibility (of fringes), 41 
Visual observation, 69 


Wave, electromagnetic, 1 
Wavefront, 5, 17, 22 
Wavelength, 1 
Wave-number, 5 
Wave-vector, 5 

White light, 11 


Young, T., 81 
Young’s experiment, 81 


Zero order, 88 
Zero path difference, 86 
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Physical constants and conversion factors 


Avogadro constant Lor Ny 6.022 1073 mol”! 
Bohr magneton Hp 9.274 x 10-74 J T7! 
Bohr radius ao 5.292 x 107!! m 
Boltzmann constant k 1381x1073] K= 
charge of an electron e —1.602 x107" C 
Compton wavelength of electron 4,=h/me=2-426 x 107!? m 


Faraday constant 


F 9.649 x 10* C mol~™’ 
fine structure constant a= poe*c/2h=7.297 x 107 3(4-! = 137.0) 
gas constant R 8.314 J K~! mol`! g 
gravitational constant G 6.673 x 107}! N m? kg~? 
nuclear magneton Hy S051 «10727 jT 
permeability of a vacuum Ho 4xx 10-7 H m~! exactly 
permittivity of a vacuum & 8.854 x 107'? F m~! (1/4209 = 

8.988 x 10° m F`!) 
Planck constant h 6.626 x 10-34 J s ts 
(Planck constant)/2z hi 1.055 x 10734 J s=6.582x 10° '° 
eVs 

Test mass of electron m, 9.110 x 107°! kg=0.511 MeV/c? 
rest mass of proton mp 1.673 x 107?7 kg=938.3 MeV/c? 
Rydberg constant R, =u metc?/8h? =1.097 x 107 m=! 
speed of light in a vacuum c 2.998 x 10° m s~! 
Stefan-Boltzmann constant o =2n°k*/15h3c? = 5.670 x 107ë W m~? K~* 
unified atomic mass unit (C) u 1.661 x 10-77 kg=931.5 MeV/c? 
wavelength of a 1 eV Photon 1.243 x 1076 m 


1 dyne=10-5 N; 1 gauss G)=10>4 tesla (T); 
=273.15 K; 1 curie (Ci)=3.7 x 1010 aa i — 

1 J=10? erg=6.241 x 10!8ey: leV=1.602 x 10-19 J; 1 calp =4.184 J; 
In 10=2303; In x=2303 log x: 5 f 


€=2718; log e=0.4343; n=3.142 


This book covers basic classical optics — geometrical 
and physical — ata level suitable for the first or 
second year of a degree course. The first chapter 
discusses the properties of the electromagnetic waves 
in the optical region and compares them with other 
regions of the spectrum. The following chapters deal 
with the geometrical optics model, the propagation of 
waves, polarization and optical instruments. The final 
chapters on laser light and optical fibres show the 


application of the ideas developed earlier in the book. ` 


The text also shows how the properties of light fit into 
the general scheme of physics, and how these 
properties are used in many of the instruments that are 
among the basic tools of science. 
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